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to Nancy 


PREFACE 


Once upon a time students of mathematics and students of science or 
engineering took the same courses in mathematical analysis beyond calculus. 
Now it is common to separate “‘advanced mathematics for science and engi- 
neering”? from what might be called “‘advanced mathematical analysis for 
mathematicians.”’ It seems to me both useful and timely to attempt a 
reconciliation. 

The separation between kinds of courses has unhealthy effects. Mathe- 
matics students reverse the historical development of analysis, learning the 
unifying abstractions first and the examples later (if ever). Science students 
learn the examples as taught generations ago, missing modern insights. A 
choice between encountering Fourier series as a minor instance of the repre- 
sentation theory of Banach algebras, and encountering Fourier series in 
isolation and developed in an ad hoc manner, is no choice at all. 

It is easy to recognize these problems, but less easy to counter the legiti- 
mate pressures which have led to a separation. Modern mathematics has 
broadened our perspectives by abstraction and bold generalization, while 
developing techniques which can treat classical theories in a definitive way. 
On the other hand, the applier of mathematics has continued to need a variety 
of definite tools and has not had the time to acquire the broadest and most 
definitive grasp—to learn necessary and sufficient conditions when simple 
sufficient conditions will serve, or to learn the general framework encompass- 
ing different examples. 

This book is based on two premises. First, the ideas and methods of the 
theory of distributions lead to formulations of classical theories which are 
satisfying and complete mathematically, and which at the same time provide 
the most useful viewpoint for applications. Second, mathematics and science 
students alike can profit from an approach which treats the particular in a 
careful, complete, and modern way, and which treats the general as obtained 
by abstraction for the purpose of illuminating the basic structure exemplified 
in the particular. As an example, the basic L? theory of Fourier series can be 
established quickly and with no mention of measure theory once L7(0, 27) is 
known to be complete. Here L7(0, 277) is viewed as a subspace of the space of 
periodic distributions and is shown to be a Hilbert space. This leads to a dis- 
cussion of abstract Hilbert space and orthogonal expansions. It is easy to 
derive necessary and sufficient conditions that a formal trigonometric series 
be the Fourier series of a distribution, an L? distribution, or a smooth 
function. This in turn facilitates a discussion of smooth solutions and distri- 
bution solutions of the wave and heat equations. 

The book is organized as follows. The first two chapters provide back- 
ground material which many readers may profitably skim or skip. Chapters 
3, 4, and 5 treat periodic functions and distributions, Fourier series, and 
applications. Included are convolution and approximation (including the 
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Weierstrass theorems), characterization of periodic distributions, elements of 
Hilbert space theory, and the classical problems of mathematical physics. The 
basic theory of functions of a complex variable is taken up in Chapter 6. 
Chapter 7 treats the Laplace transform from a distribution-theoretic point of 
view and includes applications to ordinary differential equations. Chapters 6 
and 7 are virtually independent of the preceding three chapters; a quick 
reading of sections 2, 3, and 5 of Chapter 3 may help motivate the procedure 
of Chapter 7. 

I am indebted to Max Jodeit and Paul Sally for lively discussions of what 
and how analysts should learn, to Nancy for her support throughout, and 
particularly to Fred Flowers for his excellent handling of the manuscript. 


Richard Beals 
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Chapter I 


Basic Concepts 


§1. Sets and functions 


One feature of modern mathematics is the use of abstract concepts to 
provide a language and a unifying framework for theories encompassing 
numerous special cases and examples. Two important examples of such 
concepts, that of “‘metric space” and that of “‘vector space,” will be taken up 
later in this chapter. In this section we discuss briefly the concepts, even more 
basic, of “‘set’? and of “function.” 

We assume that the intuitive notion of a “set”? and of an “‘element”’ of a 
set are familiar. A set is determined when its elements are specified in some 
manner. The exact manner of specification 1s irrelevant, provided the elements 
are the same. Thus 


A = {3, 5, 7} 
means that A is the set with three elements, the integers 3, 5, and 7. This is the 
same as 


A= {7, 3, S}; 
Or 


A = {n| nis an odd positive integer between 2 and 8} 


or 
A = {2n+ 1|n = 1, 2, 3}. 


In expressions such as the last two, the phrase after the vertical line is sup- 
posed to prescribe exactly what precedes the vertical line, thus prescribing 
the set. It is convenient to allow repetitions; thus A above is also 


{5, 3, 7, 3, 3}, 
still a set with three elements. If x is an element of A we write 
xeEA or AdxX. 
If x is not an element of A we write 
x@A or APxX. 


The sets of all integers and of all positive integers are denoted by Z and 
Z., respectively: 
Z = {0, 1, —1, 2, —2, 3, —3,...}, 
Z, = {1, 2, 3,4,...}. 


As usual the three dots ... indicate a presumed understanding about what 
is omitted. 


2 Basic concepts 


Other matters of notation: 


@ denotes the empty set (no elements). 
A U B denotes the union, {x | x € A or x € B (or both)}. 
AQ B denotes the intersection, {x | x € A and x é€ B}. 


The union of A;, Ag,..., A, is denoted by 
A,UAgU AgU ++» U Am or () Ay, 
j=l 


and the intersection by 


Ay AgN Ag +++ AAn or () Aj 
j=l 


The union and the intersection of an infinite family of sets 4,, 42... indexed 
by Z, are denoted by 


LJ A, and C) A, 
j=l j=l 


More generally, suppose J is a set, and suppose that for each j €J we are 
given a set A,;. The union and intersection of all the A, are denoted by 


) A; and () Aj. 
jeJ jeJ 
A set A is a subset of a set Bif every element of A is an element of B; we 


write 
ACB or BOA, 


In particular, for any A we have @ © A. If A < B, the complement of A in B 
is the set of elements of B notin A: 


B\A = {x|xeB,x ¢ A}. 
Thus C = B\A is equivalent to the two conditions 
AUC=B, ANC=26. 


The product of two sets A and B is the set of ordered pairs (x, y) where 
xeAand ye B; this is written A x B. More generally, if A,, Ag,..., A, are 
the sets then 


A, X Ag X-:-X A, 
is the set whose elements are all the ordered n-tuples (x;, x2,..., X,), where 
each x; € A;. The product 
AxAx--+-xXA 
of n copies of A is also written A”. 


A function from a set A to a set B is an assignment, to each element of A, 
of some unique element of B. We write 


f:A->B 
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for a function f from A to B. If x € A, then f(x) denotes the element of B 
assigned by f to the element x. The elements assigned by f are often called 
values. Thus a real-valued function on A is a function f: A > R, R the set of 
real numbers. A complex-valued function on A is a function f: A + C, C the 
set of complex numbers. 

A function f: A — B is said to be I-1 (“one-to-one”) or injective if it 
assigns distinct elements of B to distinct elements of A: If x, ye A and 
x # y, then f(x) # f(y). A function f: A — B is said to be onto or surjective 
if for each element y € B, there is some x € A such that f(x) = y. A function 
f: A —> B which is both 1-1 and onto is said to be bijective. 

If f: A— B and g: BC, the composition of f and g is the function 
denoted by g °/: 


gof:A->c,  gof(x)=s(f/(s)), forall xeA. 


If f: A — Bis bijective, there is a unique inverse function f~+: B-—> A with the 
properties: f~1 0 f(x) = x, for all xe A; fof-*(y) = y, for all ye B. 


Examples 


Consider the functions f: Z—>Z,, g:Z>2Z, h:Z—Z, defined by 


f(n) = nr? +1, neZ, 
g(n) = 2n, neZ, 
h(n) = 1 — 4, neZ. 


Then fis neither 1-1 nor onto, g is 1-1 but not onto, h is bijective, h~*(n) = 
1 — n, and fo h(n) = n? — 2n + 2. 


A set A is said to be finite if either A = @ or there is anneZ,, anda 
bijective function f from A to the set {1, 2,...,}. The set A is said to be 
countable if there is a bijective f: A — Z,. This is equivalent to requiring that 
there be a bijective g: Z, —> A (since if such an fexists, we can take g = f7’; 
if such a g exists, take f= g~+). The following elementary criterion is 
convenient. 


Proposition 1.1. Jf there is a surjective (onto) function f: Z, —> A, then A 
is either finite or countable. 


Proof. Suppose A is not finite. Define g:Z,—->2Z, as follows. Let 
g(1) = 1. Since A is not finite, A 4 {f(1)}. Let g(2) be the first integer m such 
that f(m) # f(1). Having defined g(1), g(2),..., g(n), let g(n + 1) be the first 
integer m such that f(n + 1) ¢ {f(1), f(2),..., f(m}. The function g defined 
inductively on all of Z, in this way has the property that fog: Z,— A 1s 
bijective. In fact, it is 1-1 by the construction. It is onto because fis onto and 
by the construction, for each n the set {f(1), f(2),...,f(m)} is a subset of 


{fo g(1), fo g(2),...,feg(n)}. O 


Corollary 1.2. If B is countable and A < B, then A is finite or countable. 
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Proof. If A = @, we are done. Otherwise, choose a function f: Z, — A 
which is onto. Choose an element x € A. Define g: Z, — A by: g(n) = f(n) 
if f(n) € A, g(n) = Xo if f(n) ¢ A. Then g is onto, so A is finite or countable. [ 


Proposition 1.3. Jf A,, Ao, A3,... are finite or countable, then the sets 
n fe] 
|) A; and |)A, 
j=1 j=l 


are finite or countable. 


Proof. We shall prove only the second statement. If any of the A; are 
empty, we may exclude them and renumber. Consider only the second case. 
For each A; we can choose a surjective function f;: Z, — A,. Define f: Z, > 
Uji A; by f) = AC), £3) = AQ), £5) =AG),.-..f2 =A), £6 = 
F.(2), f(10) = f,(3),..., and in general f(2’(2k — 1)) = fk), j, k = 1,2, 
3,.... Any x € UF, A; is in some A,, and therefore there is k € Z, such that 
Fk) = x. Then f(2’(2k — 1)) = x, so fis onto. By Proposition 1.1, 7.1 A; 
is finite or countable. {] 


Example 


Let @ be the set of rational numbers: Q = {m/n | meZ,neZ,}. This is 
countable. In fact, let A, = {j/n|j¢Z, —n? <j < n*}. Then each A, is 
finite, and @ = (7., Ay. 


Proposition 1.4. If A;, Ag,..., A, are countable sets, then the product set 
A, X Ag X --: X A, is countable. 


Proof. Choose bijective functions f;: A; > Z,, 7 = 1,2,...,n. For each 
meZ., let B, be the subset of the product set consisting of all n-tuples 
(X1, X2,..-,X,) such that each f,(x;) < m. Then B,, is finite (it has m” ele- 
ments) and the product set is the union of the sets B,,. Proposition 1.3 gives 
the desired conclusion. [ 


A sequence in a set A is a collection of elements of A, not necessarily 
distinct, indexed by some countable set J. Usually J is taken to be Z, or 
Z, U {0}, and we use the notations 


(Qn)n=1 = (a, a2, 43, .- )s 
(Qn)n=0 = (ao, a, Qo5. .). 


Proposition 1.5. The set S of all sequences in the set {O, 1} is neither finite 
nor countable. 


Proof. Suppose f: Z, — A. We shall show that fis not surjective. For 
each meZ,, f(m) is a sequence (@y.m)n=1 = (Gi,m> Ge,m>---), Where each 
Qn.m 1S 0 or 1. Define a sequence (a,)y-1 by setting a, = Oifa,, = 1, a, = 1 
if dy, = 0. Then for each me Z,, (Qy)n=1 F (GQn.m)na1 = f(m). Thus fis not 
surjective. [ 
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We introduce some more items of notation. The symbol = means 
““implies’’; the symbol < means “is implied by”’; the symbol <> means “‘is 
equivalent to.” 

Anticipating §2 somewhat, we introduce the notation for intervals in the 
set R of real numbers. If a, be R anda < b, then 


(a,b) = {x | xeR,a < x < dB}, 
(a, b] = {x |xeR,a < x < B},” 
[a,b) = {x|xeR,a <x < B}, 
[a,b] = {x |xeR,a<x < 5B}. 


Also, 


(a, 00) = {x |xeR,a < x}, 
(—o, a] = {x |xeR, x < a}, ete. 


§2. Real and complex numbers 


We denote by R the set of all real numbers. The operations of addition 
and multiplication can be thought of as functions from the product set 
R x R to R. Addition assigns to the ordered pair (x, y) an element of R 
denoted by x + y; multiplication assigns an element of R denoted by xy. 
The algebraic properties of these functions are familiar. 


Axioms of addition 


Al. (x + y)+z2=x+4+(y9 + 2), for any x, y,zER. 

A2.x+y=y+4+-X, for any x,yeR. 

A3. There is an element 0 in R such that x + 0 = x for every x ER. 
A4. For each x € R there is an element —x € R such that x + (—x) = 0. 


Note that the element 0 is unique. In fact, if 0’ is an element such that 
x + 0’ = x for every x, then 


0'=0'+0=040' =0. 
Also, given x the element — x is unique. In fact, if x + y = 0, then 


y=yt+O=yt+ (e+ (-x)) = (9 + x) + (-*) 
= (x+y) + (-x) = 0+ (-x) = (-x) + 0 = —x. 


This uniqueness implies —(—x) = x, since (—x) + x = x + (—x) = 0. 


Axioms of multiplication 


M1. (xy)z = x(yz), for any x, y,zER. 

M2. xy = yx, for any x, yER. 

M3. There is an element 1 4 0 in R such that x1 = x for any xeER. 

M4. For each xe R, x 4 0, there is an element x~? in R such that 
xx7+ = 1. 
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Note that 1 and x7? are unique. We leave the proofs as an exercise. 


Distributive law 
DL. x(y + Zz) = xy + xz, for any x, y,zER. 


Note that DL and A2 imply (x + y)z = xz + yz. 
We can now readily deduce some other well-known facts. For example, 


0-x = (0+ 0)-x = 0-x + 0-x, 
so 0-x = 0. Then 
x+(-1)-x =1-x% + (-1)-x =( + (-))-x = 0-x = 0, 
so (—1)-x = —x. Also, 
(—x)-y = (—1)-x)-y = (—1)- (xy) = — xy. 
The axioms Al-A4, M1-—M4, and DL do not determine R. In fact there 
is a set consisting of two elements, together with operations of addition and 


multiplication, such that the axioms above are all satisfied: if we denote the 
elements of the set by 0, 1, we can define addition and multiplication by 


0+0=1+1=0, 0O+1=1+4+0=1, 
0-0=1-0=0-1 =0, ll=1. 


There is an additional familiar notion in R, that of positivity, from which 
one can derive the notion of an ordering of R. We axiomatize this by intro- 
ducing a subset P < R, the set of “positive” elements. 


Axioms of order 


Ol. If x eR, then exactly one of the following holds: x ¢ P, x = 0, or 


—xeP. 
O2. If x, ye P, then x + yeP. 
O3. If x, ye P, then xy é€ P. 


It follows from these that if x 4 0, then x? € P. In fact if x € P then this 
follows from O3, while if —x eP, then (— x)? €P, and (—x)? = —(x(—x)) 
—(—x?) = x?. In particular, 1 = 1? eP. 

We definex <yify—xeEP,x>yif y < x. It follows that xe P= 
x > 0. Also, if x < yand y < z, then 


z—-x=(z2-y+(yv-xeP, 
so x < z. In terms of this order, we introduce the Archimedean axiom. 
O4. If x, y > 0, then there is a positive integer n such that nx = x + 
X+++++xXIS > y. 


(One can think of this as saying that, given enough time, one can empty 
a large bathtub with a small spoon.) 
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The axioms given so far still do not determine R; they are all satisfied by 
the subset Q of rational numbers. The following notions will make a distinc- 
tion between these two sets. 

A nonempty subset A < R is said to be bounded above if there is anx eR 
such that every y € A satisfies y < x (as usual, y < x means y < xory = xX). 
Such a number x is called an upper bound for A. Similarly, if there is an 
x € R such that every y € A satisfies x < y, then A is said to be bounded below 
and x is called a lower bound for A. 

A number xe R is said to be a least upper bound for a nonempty set 
A < R if x is an upper bound, and if every other upper bound x’ satisfies 
x’ > x. If such an x exists it is clearly unique, and we write 


x = lubA. 


Similarly, x is a greatest lower bound for A if it is a lower bound and if every 
other lower bound x’ satisfies x’ < x. Such an x is unique, and we write 


x = glb A. 
The final axiom for R is called the completeness axiom. 


OS. If A is a nonempty subset of R which is bounded above, then A has 
a least upper bound. 


Note that if A < Ris bounded below, then theset B = {x | xe R, —x e A} 
is bounded above. If x = lub B, then —x = glb A. Therefore O5 is equiva- 
lent to: a nonempty subset of R which is bounded below has a greatest lower 
bound. 


Theorem 2.1. © does not satisfy the completeness axiom. 


Proof. Recall that there is no rational p/q, p, p € Z, such that (p/q)? = 2: 
in fact if there were, we could reduce to lowest terms and assume either p or q 
is odd. But p? = 2q? is even, so p is even, so p = 2m, me Z. Then 4m? = 2@?, 
so q? = 2m? is even and q is also even, a contradiction. 

Let A = {x|xeQ, x? < 2}. This is nonempty, since 0,1¢A. It is 
bounded above, since x => 2 implies x* > 4, so 2 is an upper bound. We shall 
show that no x € Q is a least upper bound for A. 

If x < 0, then x < 1 € A, so x is not an upper bound. Suppose x > 0 and 
x? < 2. Suppose he Q and 0 <h< 1. Then x+AeEQ andx+h>x. 
Also, (x + h)? = x7 + 2xh + h? < x2 + 2xh +h = x? + (2x + Dh. If we 
choose A > 0 so small that h < 1 andh < (2 — x?)/(2x 4+ 1), then (x + h)? 
<2. Thenx + he A,andx +h > x,so xis not an upper bound of A. 

Finally, suppose x € Q, x > 0,and x? > 2.Supposehe QandO <h < x. 
Then x —heQ and x —h> 0. Also, (x — h)? = x? — 2xh + h? > x? - 
2xh. If we choose h > 0 so small that h < 1 and A < (x? — 2)/2x, then 
(x — h)* > 2. It follows that if ye A, then y < x —h. Thus x — his an 
upper bound for A less than x, and x is not the least upper bound. [J 


We used the non-existence of a square root of 2 in Q to show that O5 does 
not hold. We may turn the argument around to show, using O5, that there is 
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a real number x > 0 such that x? = 2. In fact, let A = {y| ye R, y? < 2}. 
The argument proving Theorem 2.1 proves the following: A is bounded 
above; its least upper bound x is positive; if x? < 2 then x would not be an 
upper bound, while if x? > 2 then x would not be the least upper bound. 
Thus x? = 2. 

Two important questions arise concerning the above axioms. Are the 
axioms consistent, and satisfied by some set R? Is the set of real numbers the 
only set satisfying these axioms? 

The consistency of the axioms and the existence of R can be demonstrated 
(to the satisfaction of most mathematicians) by constructing R, starting with 
the rationals. 

In one sense the axioms do not determine R uniquely. For example, let 
R° be the set of all symbols x°, where x is (the symbol for) a real number. 
Define addition and multiplication of elements of R° by 


x° + y® = (x + y)", xy? = (xy). 


Define P° by x°¢ P°<>xeP. Then R° satisfies the axioms above. This is 
clearly fraudulent: R° is just a copy of R. It can be shown that any set with 
addition, multiplication, and a subset of positive elements, which satisfies all 
the axioms above, is just a copy of R. 

Starting from R we can construct the set C of complex numbers, without 
simply postulating the existence of a “quantity” isuch that i? = —1. Let C° 
be the product set R? = R x R, whose elements are ordered pairs (x, y) of 
real numbers. Define addition and multiplication by 


(x, y) + (x’, y’) = (x + x,y + y)s 
(x, v(x’, y’) = (xx' — yy’, xy’ + x’y). 


It can be shown by straightforward calculations that C° together with these 
operations satisfies Al, A2, Ml, M2, and DL. To verify the remaining 
algebraic axioms, note that 


(x, y) + (0, 0) = (, y). 
(x, y) + (—x, —y) = (0, 0), 
(x, y)(l, 0) = (x, y), 
(x, y(x/(x? + y?), —y/? + y?)) = (1,0) if (x, ») # (0, 0). 


If x € R, let x° denote the element (x, 0) e C°. Let i° denote the element 
(0, 1). Then we have 


(x, ¥) = (x, 0) + (0, y) = (x, 0) + ©, I)(y, 0) = x° + iy?. 


Also, (1°)? = (0, 1)(0, 1) = (—1, 0) = —1°. Thus we can write any element 
of C° uniquely as x° + i°y°, x, ye R, where (i°)? = —1°. We now drop the 
superscripts and write x + iy for x° + i°y® and C for C°: this is legitimate, 
since for elements of R the new operations coincide with the old: x° + y® = 
(x + y)°, x°y® = (xy)°. Often we shall denote elements of C by z or w. When 
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we write z = x + iy, we shall understand that x, y are real. They are called 
the real part and the imaginary part of z, respectively: 


Zz=x+ iy, x = Re(z), y = Im(z). 


There is a very useful operation in C, called complex conjugation, defined 
by: 
z* = (x + iy)* = x — iy. 


Then z* is called the complex conjugate of z. It is readily checked that 


(z + w)* = z* + w*, (zw)* = z*w*, 
(z*)* =z, z¥z = x? + y?, 


Thus z*z 4 0 if z # 0. Define the modulus of z, |z|, by 

|z| = (z*z)i? = (x? + y*)42, 0 z= x+ iy. 
Then if z 4 0, 

1 = z*z|z|~? = z(z*|z|~), 
or 
z~+ = 2*|2|~?, 
Adding and subtracting gives 
z+z* = 2x, z— z* = iy ifz=x+iy. 

Thus 

Re (z) = 4(z + 2*), Im (z) = 4i71(z — 2*). 


The usual geometric representation of C is by a coordinatized plane: 
z = x + iy is represented by the point with coordinates (x, y). Then by the 
Pythagorean theorem, |z| is the distance from (the point representing) z to 
the origin. More generally, |z — w| is the distance from z to w. 


Exercises 


=" 


. There is a unique real number x > 0 such that x? = 2. 
2. Show that Re(z + w) = Re(z) + Re(w), Im(z + w) = Im(z) + Im(). 
3. Suppose z = x + iy, x, yER. Then 


Ix] < lz), lyf slz|, lz] s |] + II. 
4. For any z, weC, 
|zw*| = |z| |wI. 


5. For any z, we C, 


lA 


lz + wl < |z| + [w. 
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(Hint: |z + w|? = (z + w)*(z + w) = [z/? + 2 Re (zw*) + |w/?; apply Ex- 
ercises 3 and 4 to estimate |Re (zw*)|.) 

6. The Archimedean axiom O4 can be deduced from the other axioms for 
the real numbers. (Hint: use O5). 

7. If a > 0 and n is a positive integer, there is a unique b > O such that 
b" = a. 


§3. Sequences of real and complex numbers 


A sequence (Z,)r-, of complex numbers is said to converge to z € C if for 
each « > 0, there is an integer N such that |z, — z| < « whenever n > N. 
Geometrically, this says that for any circle with center z, the numbers z, all 
lie inside the circle, except for possibly finitely many values of n. If this is the 
case we write 


Z,x—>zZ, or limz, =z, or limz, = z. 


na— co 


The number z is called the /imit of the sequence (z,)7_,. Note that the limit is 
unique: suppose z, — z and also z, — w. Given any « > 0, we can take n so 
large that |z, — z| < e and also |z, — w| < e. Then 


lz —w| < |z—2z,| + [Zn — wl < e+e =2e. 


Since this is true for all « > 0, necessarily z = w. 
The following proposition collects some convenient facts about con- 


vergence. 
Proposition 3.1. Suppose (z,)r_-1 and (W,)n=1 are sequences in C. 


(a) z, —z if and only if z, — z—~0. 

(b) Let z, = Xn, + iVny Xn Vn real. Then z,—>z = x + iy if and only if 
Xn —> x and y,—>Y. 

(c) If z,—>z and w, -—> Ww, then z, + w,—>Z+ WwW. 

(d) If z, — z and w,, —> w, then z,w, —> ZW. 

(e) If Zz, —>z # 0, then there is an integer M such that z, # 0ifn => M. 
Moreover (z,,~1)P=y converges to z~, 


Proof. (a) This follows directly from the definition of convergence. 
(b) By Exercise 3 of §3, 
[Xn — X| + [¥n — YS [Zn — 2] S 2|%n — x] + 2] yn — I. 
It follows easily that z, — z—>0 if and only if x, — x ~0O andy, — y>0. 
(c) This follows easily from the inequality 
\(z + Wa) — (2 + w)| = [Zn — 2) + Wa — W)| S [Zn — 2] + [Wa — WI. 
(d) Choose M so large that if n => M, then |z, — z| < 1. Then for 


n>=M, 
[Zn] = \(Zn — z) + 2] < 1 4+ {z]. 
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Let K = 1 + |w| + |z|. Then for alln > M, 

= IZn(Wn _ w) + (Zn ~ z)w| 

< [zp] [wa — wl + 2, — 2] [wl 
< K(|w, — w| + |z, — 2z]). 


[Z,Wn — ZW| 


Since w, — w—>Qand z, — z— 0, it follows that z,w, — zw—>0. 
(e) Take M so large that |z, — z| < 4|z| whenn > M. Then forn = M, 


= | 
= [Zn] + |z — Za] — 3/2] = [Zn + @ — 2n)| — 41z| = 31zl. 
Therefore, z, ~ 0. Also forn > M. 


(z — Z,)z~1z,71| 


Jean? = 274] = | 
s |Z ~ Z,|°|Z|~*-Glz|)~* = K|z ~ Znl; 


where K = 2|z|~?. Since z — z, > 0 we have z,-1 — z~? +0. [ 


A sequence (z,)7~1 in C is said to be bounded if there isan M > Osuch that 
\Zn| < M for all n; in other words, there is a fixed circle around the origin 
which encloses all the z,’s. 

A sequence (x,,)7- 1 in R is said to be increasing if for each n, X, < Xna13 
it is said to be decreasing if for each n, xX, > Xn41.- 


Proposition 3.2. A bounded, increasing sequence in R converges. A bounded, 
decreasing sequence in R converges. 


Proof. Suppose (x,)7-1 is a bounded, increasing sequence. Then the set 
{x, |n = 1,2,...} is bounded above. Let x be its least upper bound. Given 
e > 0, x — eis not an upper bound, so there is an N such that xy > x — «. 
Ifn => N, then 


X—-—€SfXyn SX, SX, 


so |x, — x| < «. Thus x,->x. The proof for a decreasing sequence is 
similar. {J 


If A < Ris bounded above, the least upper bound of A is often called the 
supremum of A, written sup A. Thus 


sup A = lub A. 


Similarly, the greatest lower bound of a set B < R which is bounded below 
is also called the infimum of A, written inf A: 
inf A = glb A. 

Suppose (x,)7-, 1s a bounded sequence of reals. We shall associate with 


this given sequence two other sequences, one increasing and the other 
decreasing. For each n, let A, = {Xn, Xn41, Xn42,---}, and set 


x, = inf A,, x, = sup A,. 
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Now A, > An+1, 80 any lower or upper bound for A, is a lower or upper 
bound for A,4,. Thus 
, , " n” 
Xn s Xn+1) Xn < Xn+1° 


Choose M so that |x,| < M, all n. Then —M is a lower bound and M an 
upper bound for each A,. Thus 


(3.1) —-M<x,<x,<M, all n. 


We may apply Proposition 3.2 to the bounded increasing sequence (x;,)n=1 
and the bounded decreasing sequence (x,)r-1 and conclude that both 
converge. We define 


lim inf x, = lim x, 

lim sup x, = lim x;. 
These numbers are called the Jower limit and the upper limit of the sequence 
(x,) 21, respectively. It follows from (3.1) that 
(3.2) —M < liminf x, < limsupx, < M. 


A sequence (z,)2-1 in C is said to be a Cauchy sequence if for each e > 0 
there is an integer N such that |z, — z,| < e whenever n > N andm > N. 
The following theorem is of fundamental importance. 


Theorem 3.3. A sequence in C (or R) converges if and only if it is a Cauchy 
sequence. 


Proof. Suppose first that z, — z. Given e > 0, we can choose N so that 
|Z, — 2| < 4ifm = N. Then ifn,m = N we have 


\Zn — Zml S |2n — 2| + |Z — 2m| < de + fe =. 


Conversely, suppose (z,)”_-1 is a Cauchy sequence. We consider first the 
case of a real sequence (x,)7-, which is a Cauchy sequence. The sequence 
(x,)2=-1 is bounded: in fact, choose M so that |x, — x,| < lifn,m > M. 
Then ifn > M, 


[xn] < |X. — xXm| + [xml < 1+ |xu]- 


Let K = max {|x,|, |xo|,..., |Xa-al, [xu] + 1}. Then for any n, |x,| < K. 
Now since the sequence is bounded, we can associate the sequences (x;,)r=1 
and (x7)2., as above. Given « > 0, choose N so that |x, — x,| < ¢ if 
n,m > N. Now suppose n = m = N. It follows that 


Xm —~€S Xn S Xm + n=>me2N. 
By definition of x;, we also have, therefore, 
Xm ~€E SX, SS Xn te, n>me2N. 
Letting x = lim inf x, = lim x,, we have 


Xm ~ESXSXm+ 8 m2 N, 
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Or |Xm — x| < e,m > N. Thus x,—x. 

Now consider the case of a complex Cauchy sequence (z,)7_,. Let 
Zn = Xn + Wns Xn» Vn e R. Since |X, — Xm| < [Zn — Zml, (%n)P21 is a Cauchy 
sequence. Therefore x,-—>xeR. Similarly, y,->yeR. By Proposition 
3.1(b), z, >x + iy. J 


The importance of this theorem lies partly in the fact that it gives a cri- 
terion for the existence of a limit in terms of the sequence itself. An im- 
mediately recognizable example is the sequence 


3, 3.1, 3.14, 3.142, 3.1416, 3.14159,..., 


where successive terms are to be computed (in principle) in some specified 
way. This sequence can be shown to be a Cauchy sequence, so we know it has 
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a limit. Knowing this, we are free to give the limit a name, such as “‘7”’. 


We conclude this section with a useful characterization of the upper and 
lower limits of a bounded sequence. 


Proposition 3.4. Suppose (x,)n=1 is a bounded sequence in R. Then lim inf x, 
is the unique number x’ such that 


(i)’ for any e > O, there is an N such that x, > x' — e whenevern > N, 
(ii)’ for any « > 0 and any N, there is ann = N such that x, < x’ + «. 


Similarly, lim sup x, is the unique number x" with the properties 


(i)” for any « > O, there is an N such that x, < x" + « whenevern > N, 
(ii)” for any « > 0 and any N, there is ann > N such that x, > x — «. 


Proof. We shall prove only the assertion about lim inf x,. First, let 
x), = inf {x,, Xn41,...} = Inf A, as above, and let x’ = lim x, = lim inf x,. 
Suppose « > 0. Choose WN so that xy, > x’ — e. Then n = N implies x, = 
Xy > x’ — «, So (i)’ holds. Given « > 0 and N, we have xy < x’ < x’ + te. 
Therefore x’ + $e is not a lower bound for Ay, so there is ann > Nsuch that 
Xn <x’ +4e < x’ + «. Thus (11)’ holds. 

Now suppose x’ is a number satisfying (i)’ and (11)’. From (i)’ it follows 
that inf A, > x’ — « whenever n > N. Thus liminf x, > x’ — e, all e, so 
lim inf x, = x’. From (ii)’ it follows that for any N and any e, inf Ay < 
x’ + «. Thus for any N, inf Ay < x’, so liminf x, < x’. We have lim inf 
x, =x’. O 


Exercises 


1. The sequence (1/n)?_, has limit 0. (Use the Archimedean axiom, §2.) 

2. If x, > 0 and x, +0, then x,2/? — 0. 

3. If a > 0, then a!" + 1 as n—>oo. (Hint: if a > 1, let a?" = 1 4+ x,,. 
By the binomial expansion, or by induction, a = (1 + x,)" < 1 + nx,. Thus 
xX, < n-ia—>0. Ifa < 1, then a?” = (b'")-1 where b = a™1 > 1.) 
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4. lim nt” = 1, (Hint: let n/" = 1+ y,. For n>2,n=(1+y,)" => 
1 + ny, + 4n(n — 1)y,? > 4n(n — l)yn?, SO yn? < 2(n —:1)-+-—+0. Thus 
Yn > 0.) 

5. If ze C and |z| < 1, then z2*7>>0asn—o, 

6. Suppose (x,)7~1 iS a bounded real sequence. Show that x, — x if and 
only if lim inf x, = x = limsup x,. 

7. Prove the second part of Proposition 3.4. 

8. Suppose (x,)r=1 and (a,)?_, are two bounded real sequences such that 
a,—>a> 0. Then 


lim inf a,x, = a-lim inf x,, lim sup a,x, = a-lim sup x,. 


$4. Series 


Suppose (Z,)n=1 1S a sequence in C. We associate to it a second sequence 

(S,)r=1, Where 
n 
S, = > Zn = Zt Zo tee + Ly 
m=1 

If (s,)r-1 converges to s, it is reasonable to consider s as the infinite sum 
>n=1 Zn. Whether (s,)¢-1 converges or not, the formal symbol >°_, z, or > Z, 
is called an infinite series, or simply a series. The number z, is called the nth 
term of the series, s, is called the nth partial sum. If s, —> s we say that the 
series > Z, converges and that its sum is s. This is written 


(4.1) s= S Zn 


(Of course if the sequence is indexed differently, e.g., (z,)7- 5, we make the 
corresponding changes in defining s, and in (4.1).) If the sequence (s,,)°_ , does 
not converge, the series > z, is said to diverge. 

In particular, suppose (x,)7°- 1 1s a real sequence, and suppose each x, > 0. 
Then the sequence (s,)7-, of partial sums is clearly an increasing sequence. 
Fither it is bounded, so (by Proposition 3.2) convergent, or for each M > 0 
there is an N such that 


nr 
S, = > Xm > M whenever n > N. 
m=1 


In the first case we write 


(4.2) S Xn < 
n=1 


and in the second case we write 


(4.3) S Xn = ©. 
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Thus (4.2) <> > x, converges, (4.3) <> > x, diverges. 


Examples 


1. Consider the series 57,071. We claim >°,n~1 = oo. In fact 
(symbolically), 


Dmtaltdcggdghcgegtgy.. 
2E+t+G+aytG+etet+nt: 
=2 +4 + 2G) + 4G) + 8Ge) + --- 
=2+4+34++:-=0 


2. Dn=1Nn~* < o. In fact (symbolically), 

Dn =14+ G24 G2 +--+ GP +e 
1+ 6)? + G+ @? + @? + @?+ @+-- 
1 + 2G)? + 4G)? + 8@)? ++ 
1+h44444---=2. 


ll IA il 


I 


(We leave it to the reader to make the above rigorous by considering the 
respective partial sums.) 


How does one tell whether a series converges ? The question is whether the 
sequence (Sn)n=1 Of partial sums converges. Theorem 2.3 gives a necessary and 
sufficient condition for convergence of this sequence: that it be a Cauchy 
sequence. However this only refines our original question to: how does one 
tell whether a series has a sequence of partial sums which is a Cauchy 
sequence’? The five propositions below give some answers. 


Proposition 4.1. Jf > 7, z, converges, then z, — 0. 


Proof. If > z, converges, then the sequence (s,)7_-, of partial sums is a 
Cauchy sequence, so s, — s,-, > 0. But s, — 5,-, =2Z,. JQ 


Note that the converse is false: 1/n — 0 but > 1/n diverges. 
Proposition 4.2. Jf |z| < 1, then >*_, 2" converges; the sumis(1 — z)7?. 
If |z| = 1, then >. 2" diverges. 
Proof. The nth partial sum is 
Sy = ltzt2%7 +--+. 4+ 2%77, 


Then s,(1 ~ z) = 1— 2", so s, =(1 — z")/(1 — z). If |z| < 1, then as 
n—> oo, z”-> 0 (Exercise 5 of §3). Therefore s, > (1 — z)7?. If |z| = 1, then 
|z"| => 1, and Proposition 4.1 shows divergence. [] 


The series >°°_, z” is called a geometric series. 


Proposition 4.3. (Comparison test). Suppose (Z,)n=1 is a sequence in C 
and (a,)p=1 a Sequence in R with each a, > 0. If there are constants M, N 


such that 
\Z,| < Ma, whenevern => N, 
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and if > a,, converges, then > Z, converges. 


Proof. Let 8, = >r21Zms On = Dr =1a,. If n,m > N then 


n 
[Sn — Sm| = > Zn| < > |z,,| 
j=m+1 j=mt+1 
n 
<M > a, = M(b, — bn). 
j=m+1 


But (0,)7-, is a Cauchy sequence, so this inequality implies that (s,)7_1 is 
also a Cauchy sequence. [ 


Proposition 4.4. (Ratio test). Suppose (z,)r=1 is a sequence in C and 
suppose Z, # 0, all n. 


(a) If 


lim sup |Zn+1/Z,| < 1, 
then > Z, converges. 
(b) If 
lim inf |Z, 43/Z,| > 1, 
then > z,, diverges. 


Proof. (a) In this case, take r so that lim sup |[z,4,/z,| <r < 1. By 
Proposition 3.4, there is an N so that |z,+3/Z,| < r whenever n = N. Thus if 
n>WJN, 


IZnl <r|Zn-i] Srer|Zn-o] S +--+ <r" 7% |zy| = Mr*, 


where M = r~*|z,|. Propositions 4.2 and 4.3 imply convergence. 
(b) In this case, Proposition 3.4 implies that for some N, |Z,41/Z,| = 1 if 
n> N. Thus forn > N. 


[Zn] = |Zn-a| = +++ = [Z| > 0. 
We cannot have z, — 0, so Proposition 4.1 implies divergence. {] 


Corollary 4.5. Jf z, #0 for n= 1,2,... and if lim |z,,,/z,| exists, 
then the series > z, converges if the limit is <1 and diverges if the limit is >1. 


Note that for both the series > 1/n and > 1/n?, the limit in Corollary 4.5 
equals 1. Thus either convergence or divergence is possible in this case. 


Proposition 4.6. (Root test). Suppose (z,)r-1 is a sequence in C. 
(a) If 
lim sup |z,|?" < 1, 


then >. Z, converges. 


(b) if 


lim sup |z,|1/" > 1, 


then > z,, diverges. 
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Proof. (a) In this case, take r so that lim sup |z,|1/" < r < 1. By Propo- 
sition 3.4, there is an N so that |z,|’/" < r whenever n > N. Thus ifn > N, 
then |z,| < r”. Propositions 4.2 and 4.3 imply convergence. 

(b) In this case, Proposition 3.4 implies that |z,|1/" => 1 for infinitely 
many values of n. Thus Proposition 4.1 implies divergence. [] 


Note the tacit assumption in the statement and proof that (|z,|/")?_1 is a 
bounded sequence, so that the upper and lower limits exist. However, if this 
sequence is not bounded, then in particular |z,| > 1 for infinitely many 
values of n, and Proposition 4.1 implies divergence. 


Corollary 4.7. Jf lim |z,|"’" exists, then the series > z, converges if the 
limit is <1 and diverges if the limit is >1. 


Note that for both the series > 1/n and > 1/n?, the limit in Corollary 4.7 
equals 1 (see Exercise 4 of §3). Thus either convergence or divergence is 
possible in this case. 

A particularly important class of series are the power series. If (an)r=o 1S 
a sequence in C and 2) a fixed element of C, then the series 


(4.2) S a(Z — Zo)" 


is the power series around Zz, with coefficients (an)n=0. Here we use the conven- 
tion that w° = 1 for all we C, including w = 0. Thus (4.2) is defined, as a 
series, for each ze C. For z = Zp it converges (with sum ay), but for other 
values of z it may or may not converge. 


Theorem 4.8. Consider the power series (4.2). Define R by 


R=0 if (\a,|*/")?-1 is not a bounded sequence, 
R = (lim sup |a,|"")~? —s if lim sup |a,|*"" > 0, 
R=o — iflimsup fa,|*" = 0. 


Then the power series (4.2) converges if |z — Z| < R, and diverges if 
|Z — Zo| > R. 


Proof. We have 


(4.3) lan(zZ — 2Z)"|4" = |a,|*/"|z — Zo]. 


Suppose z # Zp. If (a,|1/")7_-, is not a bounded sequence, then neither is 
(4.3), and we have divergence. Otherwise the conclusions follow from (4.3) 
and the root test, Proposition 4.6. {J 


The number R defined in the statement of Theorem 4.8 is called the radius 
of convergence of the power series (4.2). It is the radius of the largest circle in 
the complex plane inside which (4.2) converges. 

Theorem 4.8 is quite satisfying from a theoretical point of view: the 
radius of convergence is shown to exist and is (in principle) determined in all 
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cases. However, recognizing lim sup |a,|?/" may be very difficult in practice. 
The following is often helpful. 


Theorem 4.9. Suppose a, # Oforn = N, and suppose lim |ay, 1/a,| exists. 
Then the radius of convergence R of the power series (4.2) is given by 


R = (lim |a,,,/a,|)~* if lim |an+1/a,| > 0. 
R=0o0 if lim |a,41/a,| = 0. 


Proof. Apply Corollary 4.5 to the series (4.2), noting that if z # zp) then 


|Qn41(Z _ Zo)"**/a,(z _ Zo)"| — [an +1/An|°|z ~ Zo|. U 


Exercises 


1. If 5_, Zz, converges with sum s and >°_, w, converges with sum f, 
then >°_, (z, + w,) converges with sum s + f¢. 

2. Suppose > a, and > b, each have all non-negative terms. If there are 
constants M > Oand N such that b, => Ma, whenever n > N, and if > a, = 
oo, then > b, = ©. 

3. Show that >°_, (n + 1)/(2n? + 1) diverges and >°_, (nm + 1)/(2n? + 1) 
converges. (Hint: use Proposition 4.3 and Exercise 2, and compare these to 
> I/n, > 1/n?.) 

4, (2*-Test). Suppose a; => ag =>-+:: > a, = O, all n. Then >, a, < 
00 <> SP, 2*age < 00. (Hint: use the methods used to show divergence of 
> 1/n and convergence of > 1/n?.) 

5. (Integral Test). Suppose a, > a, >---2>a, =0, all n. Suppose 
f: [1, 00) — R is a continuous function such that f() = a,, alln, and f(y) < 
f(x) if y = x. Then SP, a, < 0 J.” f(x) dx < a. 

6. Suppose p > 0. The series >°_, n~? converges if p > | and diverges 
if p < 1. (Use Exercise 4 or Exercise 5.) 

7. The series 7.2. n~*(log n)~? converges; the series >°_. n~ (log n)~} 
diverges. 

8. The series >.) 2"/n! converges for any ze C. (Here 0! = I,n! = 
n(n — 1)\(n — 2)-... +1.) 

9. Determine the radius of convergence of 


> 2"z"/n, > n2z"/n}, > ni! 2", 
n=0 n=1 n=0 
> n!z"/(2n)! 
n=Q 
10. (Alternating series). Suppose |x,| = |x2| =>--- = |x,|, all n, x, = 0 
ifn odd, x, < Oifneven, and x, > 0. Then > x, converges. (Hint: the partial 


sums satisfy 5s. < 54 < 5g <--- < S53 S< $3 < Sj.) 
ll. >%-1 (—1)"/n converges. 
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§5. Metric spaces 


A metric on a set S is a function d from the product set S x Sto R, with 
the properties 


D1. d(x, x) = 0, d(x, y) > 0, if x, ye S,x # y. 
D2. d(x, y) = d(y, x), all x, ye S. 
D3. d(x, z) < d(x, y) + d(y, z), all x, y,zeES. 


We shall refer to d(x, y) as the distance from x to y. A metric space isa set S 
together with a given metric d. The inequality D3 is called the triangle 
inequality. The elements of S are often called points. 

As an example, take S = R? = R x R, with 


(5.1) d((x, y), (x, y’)) = [x — x’)? + (y — yf P?. 


If we coordinatize the Euclidean plane in the usual way, and if (x, y), (x’, y’) 
are the coordinates of points P and P’ respectively, then (5.1) gives the length 
of the line segment PP’ (Pythagorean theorem). In this example, D3 is the 
analytic expression of the fact that the length of one side of a triangle is at 
most the sum of the lengths of the other two sides. The same example in 
different guise is obtained by letting S = C and taking 


(5.2) d(z, w) = |z — w| 


as the metric. Then D3 is a consequence of Exercise 5 in §2. 
Some other possible metrics on R? are: 


d(x, y), (x', y')) = |x — x'| + ly — y’I, 
do((x, y), (x’, y’)) = max {|x — x'|, |y — y']}, 
d3((x, y), (x’, y’)) = 0 if (x, y) = (’, y’), and 1 otherwise. 


Verification that the functions d,, d,, and d; satisfy the conditions D1, D2, D3 
is left as an exercise. Note that d,; works for any set S: if x, ye S we set 
d(x,y) = 1lifx #4 yandOifx = y. 

A still simpler example of a metric space is R, with distance function d 
given by 


(5.3) d(x, y) = |x — y|. 


Again this coincides with the usual notion of the distance between two points 
on the (coordinatized) line. 

Another important example is R", the space of ordered n-tuples x = 
(x1, X2,..., X,) of elements of R. There are various possible metrics on R® 
like the metrics d,, d., d,; defined above for R", but we shall consider here 
only the generalization of the Euclidean distance in R? and R®*. If x = 


(X1, 2 Xn) and y= (V1, V29+> oo) we set 
(5.4) d(x, y) = [(%1 — yi)? + (X2 — yo)? +-++ + Xn — yn)? ]?. 


When nv = 1 we obtain R with the metric (5.3); when n = 2 we obtain R? 
with the metric (5.1), in somewhat different notation. It is easy to verify that 
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d given by (5.4) satisfies D1 and D2, but condition D3 is not so easy to verify. 
For now we shall simply assert that d satisfies D3; a proof will be given ina 
more general setting in Chapter 4. 

Often when the metric d is understood, one refers to a set S alone as a 
metric space. For example, when we refer to R, C, or R” as a metric space with 
no indication what metric is taken, we mean the metric to be given by (5.3), 
(5.2), or (5.4) respectively. 

Suppose (S, d) is a metric space and T is a subset of S. We can consider 
T as a metric space by taking the distance function on T x T to be the 
restriction of dto T x T. 

The concept of metric space has been introduced to provide a uniform 
treatment of such notions as distance, convergence, and limit which occur in 
many contexts in analysis. Later we shall encounter metric spaces much more 
exotic than R” and C. 

Suppose (S, d) is a metric space, x is a point of S, and r is a positive real 
number. The ball of radius r about x is defined to be the subset of S consisting 
of all points in S whose distance from x is less than r: 


B(x) = {y |y¢ S, d(x, y) < r}. 
Clearly x € B,(x). If 0 < r < s, then B,(x) < B,(x). 


Examples 


When S = R (metric understood), B,(x) is the open interval (x — r, x +r). 
When S = R? or C, B,(z) is the open disc of radius r centered at z. Here we 
take the adjective ‘“‘open’’ as understood; we shall see that the interval and 
the disc in question are also open in the sense defined below. 


A subset A © S is said to be a neighborhood of the point x € S if A con- 
tains B(x) for some r > 0. Roughly speaking, this says that A contains all 
points sufficiently close to x. In particular, if A is a neighborhood of x it 
contains x itself. 

A subset A © S is said to be open if it is a neighborhood of each of its 
points. Note that the empty set is an open subset of S: since it has no points 
(elements), it is a neighborhood of each one it has. 


Example 


Consider the interval A = (0, 1] < R. This is a neighborhood of each of 
its points except x = 1. In fact, if 0 < x < 1, let r = min {x, 1 — x}. Then 
A> B(x) = (x — r, x + r). However, for any r > 0, B,(1) contains 1 + 4r, 
which is not in A. 


We collect some useful facts about open sets in the following proposition. 
Proposition 5.1. Suppose (S, d) is a metric space. 


(a) For any xe S and anyr > 0, B,(x) is open. 
(b) If Ay, Ag,..., Am are open subsets of S, then ( \f,=1 Am is also open. 
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(c) If (Ag)sez is any collection of open subsets of S, then gen Ag is also 
open. 


Proof. (a) Suppose ye€ B,(x). We want to show that for some s > 0, 
By) < B(x). The triangle inequality makes this easy, for we can choose 
s=r-— d(y, x). (Since y € B(x), sis positive.) If ze By), then 


d(z,x) < d(z,y) + d(y, x) < s+ d(y,x) =r. 


Thus z € B,(x). 

(b) Suppose x € ()\%-, A,,. Since each A,, is open, there is r(m) > Oso that 
Bram (Xx) © Am. Let r = min {r(1), r(2),...,r(m)}. Then r > 0 and B,(x) < 
Brym (xX) < Am, $0 B(x) © (\h=1 Am. (Why is it necessary here to assume that 
A,, Ag,... 18 a finite collection of sets ?) 

(c) Suppose x € A = Ugeg Az. Then for some particular 8, x € A,. Since 
A, is open, there is anr > 0 so that B(x) < A, < A. Thus Ais open. [ 


Again suppose (S, d) is a metric space and suppose A ¢ S. A point xe S$ 
is said to be a limit point of A if for every r > 0 there is a point of A with 
distance from x less than r: 


B(x) NA # @ ifr > 0. 


In particular, if x € A then x is a limit point of A. The set A is said to be 
closed if it contains each of its limit points. Note that the empty set is closed, 
since it has no limit points. 


Example 


The interval (0, 1] < R has as its set of limit points the closed interval 
(0, 1]. In fact if 0 < x < 1, then x is certainly a limit point. If x = O and 
r > 0, then B,(0) 0 (0, 1] = (-r,rn)N (0, 1] 4 @.Ifx < Oandr = |x|, then 
BAx) O (0, 1] = ©, while if x > 1 andr = x — 1, then B(x) N (0, 1] = @. 
Thus the interval (0, 1] is neither open nor closed. The exact relationship 
between open sets and closed sets 1s given in Proposition 5.3 below. 


The following is the analogue for closed sets of Proposition 5.1. 
Proposition 5.2. Suppose (S, d) is a metric space. 


(a) Foranyx e€ Sandanyr > 0, the closed ball C = {y | ye S, d(x, y) < r} 
is a closed set. 

(b) If Ay, Ag, ..., An are closed subsets of S, then \Ji,=1 Am is closed. 

(c) If(Ag)pgen is any collection of closed subsets of S, then (\geg Ag is closed. 


Proof. (a) Suppose z is a limit point of the set C. Given « > 0, there is 
a point ye B,(z) A C. Then 


d(z,x) < d(z,y) + d(y,x) <e+Qr. 


Since this is true for every « > 0, we must have d(z, x) < r. Thus zeEC. 
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(b) Suppose x ¢ A = (J? _, A, For each m, x is not a limit point of A,,, 
so there is r(m) > 0 such that B,(x) A A, = @. Let 


r = min {r(1), r(2),..., r(m)}. 


Then B(x) 0 A, = 2, all m, so B(x) 0 A = ©. Thus x is not a limit point 
of A. 

(c) Suppose x is a limit point of A = ()ge, Ag. For any r > 0, B(x) A 
A # @. But A © Az, so B(x) ON Az # @. Thus x is a limit point of Ag, so it 
is in Ag. This is true for each 8, soxe A. [ 


Proposition 5.3. Suppose (S, d) is a metric space. A subset A < S is open 
if and only if its complement is closed. 


Proof. Let B be the complement of A. Suppose B is closed, and suppose 
xe A. Then x is not a limit point of B, so for some r > 0 we have B(x) N 
B= @. Thus B,(x) < A, and A is a neighborhood of x. 

Conversely, suppose A is open and suppose x ¢ B. Then xe A, so for 
some r > 0 we have B(x) < A. Then B(x) A B = @, and x is not a limit 
point of B. It follows that every limit point of Bisin B. Qj 


The set of limit points of a subset A < S is called the closure of A; we 
shall denote it by A~. We have A < A™ and A is closed if and onlyif A = A-. 
In the example above, we saw that the closure of (0, 1] < R is [0, 1]. 

Suppose A, B are subsets of S and A < B. We say that A is dense in B if 
Bc A~.In particular, A is dense in S if A~ = S. As an example, Q (the 
rationals) is dense in R. In fact, suppose x € R and r > 0. Choose a positive 
integer 1 so large that 1/n < r. There is a unique integer m so that m/n < 
x < (m+ 1)/n. Then d(x, m/n) = x — m[n < (m + 1)/n — m[n = I[n < 1, 
so m/né B(x). Thus xe Q-. 

A sequence (x,)7_, in S is said to converge to x € Sif for each e > O there 
is an N so that d(x,, x) < «ifn => N. The point x is called the limit of the 
Sequence, and we write 

lim x, =x Or©rx,—>Xx. 
When S = R or C (with the usual metric), this coincides with the definition 
in §3. Again the limit, if any, is unique. 

A sequence (x,);=1 in S is said to be a Cauchy sequence if for each « > 0 
there is an N so that d(x,, Xm) < eifn,m = N. Again when S = Ror C, this 
coincides with the definition in §3. 

The metric space (S, d) is said to be complete if every Cauchy sequence in 
S converges to a point of S. As an example, Theorem 3.3 says precisely that 
RR and C are complete metric spaces with respect to the usual metrics. 

Many processes in analysis produce sequences of numbers, functions, 
etc., in various metric spaces. It is important to know when such sequences 
converge. Knowing that the metric space in question is complete is a powerful 
tool, since the condition that the sequence be a Cauchy sequence is then a 
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necessary and sufficient condition for convergence. We have already seen this 
in our discussion of series, for example. 


Note that R” is complete. To see this note that in R’, 
max {|x; — y,|,7 = 1,...,n} < d(x, y) < n-max {|x,;—y,|,7 = 1,..., n}. 


It follows that a sequence of points in R” converges if and only if each of the 
n corresponding sequences of coordinates converges in R. Similarly, a se- 
quence of points in R” is a Cauchy sequence if and only if each of the n 
corresponding sequences of coordinates is a Cauchy sequence in R. Thus 
completeness of R” follows from completeness of R. (This is simply a general- 
ization of the argument showing C is complete.) 


Exercises 


1. If (S, d) is a metric space, x € S, and r => 0, then 
{y|yeS,d(y, x) > r}_ 


is an open subset of S. 

2. The point x is a limit point of a set A < S if and only if there is a 
sequence (x,);-1 in A such that x, > x. 

3. If a sequence (x,)r-1 in a metric space converges to x € S and also 
converges to ye S, then x = jy. 

4. If a sequence converges, then it is a Cauchy sequence. 

5. If (S, d) is a complete metric space and A < Sis closed, then (A, d) is 
complete. Conversely, if B < S and (B, d) is complete, then B is a closed 
subset of S. 

6. The interval (0, 1) is open as a subset of R, but not as a subset of C. 

7. Let S = Q (the rational numbers) and let d(x, y) = |x — y|, x, ye Q. 
Show that (S, d) is not complete. 

8. The set of all elements x = (x1, X2,..., X,) in R® such that each x; 1s 
rational is a dense subset of R”. 

9. Verify that R” is complete. 


§6. Compact Sets 


Suppose that (S, d) is a metric space, and suppose A is a subset of S. The 
subset A is said to be compact if it has the following property: suppose that 
for each x € A there is given a neighborhood of x, denoted N(x); then there 
are finitely many points x,, X2,..., X, in A such that A is contained in the 
union of N(x,), N(%2),..., N(x,). (Note that we are saying that this is true 
for any choice of neighborhoods of points of A, though the selection of 
points x,,X2,... may depend on the selection of neighborhoods.) It is 
obvious that any finite subset A 1s compact. 
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Examples 


1. The infinite interval (0,00) < R is not compact. For example, let 
N(x) = (x — 1,x + 1), x€(0, 00). Clearly no finite collection of these 
intervals of finite length can cover all of (0, 00). 

2. Even the finite interval (0, 1] < R is not compact. To see this, let 
N(x) = Gx, 2), x € (0, 1]. For any x,, Xo,..., x, € (0, 1], the union of the 
intervals N(x;) will not contain y if y < 4 min {x,, x2,..., X;}. 

3. The set A = {0} U {1, 4, 4, 4,...} © R is compact. In fact, suppose 
for each x € A we are given a neighborhood N(x). In particular, the neigh- 
borhood N(0) of 0 contains an interval (—«, «). Let M be a positive integer 
larger than 1/e. Then 1/n e N(0) for n = M, and it follows that 4 < N(0) U 
N(I)UNG)U---U N/M). 

The first two examples illustrate general requirements which compact sets 
must satisfy. A subset A of S, when (S, d) is a metric space, is said to be 
bounded if there is a ball B,(x) containing A. 


Proposition 6.1. Suppose (S,d) is a metric space, S # @, and suppose 
A < S is compact. Then A is closed and bounded. 


Proof. Suppose y ¢ A. We want to show that y is not a limit point of A. 
For any x € A, let N(x) be the ball of radius 4d(x, y) around x. By assump- 
tion, there are x,, X2,...,X,¢€A such that A < (J2_, N(x,,). Let r be the 
minimum of the numbers 4d(x,, y),..., 4d(x,, y). If x € A, then for some m, 
d(X, Xm) < 4d(Xm, y). But then 


A(Xm, ¥) < A(Xm, x) + A(x, y) 
< 3d(Xm, y) + d(x, y). 


so d(x, y) > 4d(Xm, y) = r. Thus By) A = 2, and y is not a limit point 
of A. 

Next, we want to show that A is bounded. For each x € A, let N(x) be the 
ball of radius 1 around x. Again, by assumption there are x;, Xo,...,X,€A 
such that A ¢ (J? _, N(x,,). Let 


r= ] + max {d(x1, Xa)s d(x1, Xa); re) d(x, Xn)}- 
If ye A then for some m, d(y, Xm) < 1. Therefore d(y, x) < d(y, Xn) + 
A(Xms X1) < 1 + d(Xm, x1) < r, and we have A < B,(x,). OD 


The converse of Proposition 6.1, that a closed, bounded subset of a metric 
space is compact, is not true in general. It is a subtle but extremely important 
fact that it is true in R", however. 


Theorem 6.4. (Heine-Borel Theorem). A subset of R" or of C is compact 
if and only if it is closed and bounded. 


Proof. We have seen that in any metric space, if A is compact it is 
necessarily closed and bounded. Conversely, suppose A < R” is closed and 
bounded. Let us assume at first that = 1. Since A is bounded, it is contained 
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in some closed interval [a, b]. Suppose for each x € A, we are given a neigh- 
borhood N(x) of x. We shall say that a closed subinterval of [a, b] is nice if 
there are points x1, X%2,..., Xm €A such that ()7_, N(x,;) contains the inter- 
section of the subinterval with A; we are trying to show that [a, b] itself is 
nice. Suppose it is not. Consider the two subintervals [a, c] and [c, b], where 
c = $(a + b) is the midpoint of [a, b]. If both of these were nice, it would 
follow that [a, b] itself is nice. Therefore we must have one of them not nice; 
denote its endpoints by a, b,, and let c, = 4(a, + b,). Again, one of the 
intervals [a,, c,] and [c,, b;] must not be nice; denote it by [a2, b.]. Continu- 
ing in this way we get a sequence of intervals [a,,, b,,], m = 0,1, 2,... such 
that [ao, bo] = [a, b], each [a,, b,,] is the left or right half of the interval 
[4m —15 Bm], and each interval [a,,, b,,] is not nice. It follows that a) < a, < 
=) Sdn S by < +++ < by < by and b, — a, = 27>™(bo — ay) +0. There- 
fore there is a point x such that a,, > x and b,, > x. Moreover, a, < x < bn, 
for all m. We claim that x € A; it is here that we use the assumption that A is 
Closed. Since [a,, 5,] is not nice, it must contain points of A: otherwise 
AQ [am, bn] = 2 would be contained in any 3.1, N(x,). Let 


Xm © [Ans Om] O A. 


Clearly x, — x, since a, — x and b,,-—> x. Since A is closed, we get x € A. 
Now consider the neighborhood N(x). This contains an interval (x — e, 
x + «). If we choose m so large that b,, — ad, < e, then since a, < x < by 
this implies [a,,, 5,,] < N(x). But this means that [a,,, b,,] is nice. This contra- 
diction proves the theorem for the case n = 1. 

The same method of proof works in R", where instead of intervals we use 
squares, cubes, or their higher dimensional analogues. For example, when 
n = 2 we choose M so large that A is contained in the square with corners 
(+M, +M). If this square were not nice, the same would be true of one of 
the four equal squares into which it can be divided, and so on. Continuing we 
get a sequences of squares Sy > S, > S, > ---, each of side 4 the length of 
the preceding, each intersecting A, and each not nice. The intersection 
(\m=o0 Sm contains a single point x, and x is in A. Then N(x) contains S,, for 
large m, a contradiction. Since as metric space C = R?, this also proves the 
result for C. { 


Suppose (x,)7-1 1S a sequence in a set S. A subsequence of this sequence is 
a sequence of the form (y,)-1, where for each k there is a positive integer n,, 
so that 

Ny < Ng <0 < Me < Mga <0, 
Vi = Xn, 

Thus, (y,)f=1 1S just a selection of some (possibly all) of the x,’s, taken in 
order. As an example, if (x,)?., © R has x, = (—1)"/n, and if we take 
n, = 2k, then (x,)n=1 = (—1, 3, —3, 4; 4 ...)and (Ve)e=1 — (2; 45 o - .). 
As a second example, let (x,,)., be an enumeration of the rationals. Then for 
any real number x, there is a subsequence of (x,)¢-1 which converges to x. 
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Suppose (8, d) is a metric space. A set A < S is said to be sequentially 
compact if, given any sequence (x,);-1; © A, some subsequence converges 
to a point of A. 


Examples 


1. Any finite set is sequentially compact. (Prove this.) 

2. The interval (0,00) < R is not sequentially compact; in fact let 
xX, = n. No subsequence of (x,)7°_ 1 converges. 

3. The bounded interval (0, 1] < R is not sequentially compact; in fact 
let x, = 1/n. Any subsequence of (x,)r-1 converges to 0, which is not in 


(0, 1]. 
Proposition 6.3. Suppose (S, d) is a metric space, S # @, and suppose 
A © S is sequentially compact. Then A is closed and bounded. 


Proof. Suppose x is a limit point of A. Choose x, € B,),(x) A A, 
n= 1,2,3,.... Any subsequence of (x,)¢-1 converges to x, since x, > x. It 
follows (since by assumption some subsequence converges to a point of A) 
that x € A. Thus A is closed. 

Suppose A were not bounded. Take xe S and choose x, € A such that 
x € B,(x). Let r; = d(x, x) + 1. By the triangle inequality, B,(x,) © B,,(x). 
Since A is not bounded, there is x2¢A such that x. ¢ B,,(x). Thus also 
d(X1, X2) => 1. Let r. = max {d(x, x,), d(x, x2)} + 1 and choose x3; € A such 
that x; ¢ B,,(x). Then d(x;, x3) => 1 and d(x2, x3) = 1. Continuing in this 
way we can find a sequence (x,)¢_-, < A such that d(x, x,) = lifm #n. 
Then no subsequence of this sequence can converge, and A is not sequentially 
compact. [ 


Theorem 6.4. (Bolzano-Weierstrass Theorem). A subset A of R® or of C 
is sequentially compact if and only if it is closed and bounded. 


Proof. We have shown that A sequentially compact implies A closed and 
bounded. Suppose A is closed and bounded, and suppose first that n = 1. 
Take an interval [a, b] containing A. Let c = 4(a + 5). One (or both) of the 
subintervals [a, c] and [c, 5] must contain x, for infinitely many integers n; 
denote such a subinterval by [a,, b,], and consider [a,, c,], [c,, b,] where 
c, = $(a, + 5,). Proceeding in this way we can find intervals [a,,, b,,] with 
the properties [do, bo] = [a,b], [an bn] © [Qn-1, bm-1],. On — Am = 
2~™(bo — ao), and [a,,, b,,] contains x, for infinitely many values of n. Then 
there is a point x such that a, — x, b,,—> x. We choose integers n,, No,... 
so that x,, € (a), b,], ng > mn, and x,, € [de, be], nz > ng and x,, € [az, bg], etc. 
Then this subsequence converges to x. Since A is closed, x € A. 

The generalization of this proof to higher dimensions now follows as in 
the proof of Theorem 6.2. [ 


Both the terminology and the facts proved suggest a close relationship 
between compactness and sequential compactness. This relationship is made 
precise in the exercises below. 
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Exercises 


1. Suppose (x,)-1 iS a Sequence in a metric space (S, d) which converges 
toxe S. Let A = {x} U {x,}”_,. Then A is compact and sequentially compact. 

2. Let Q, the rationals, have the usual metric. Let A = {x | x e Q, x? < 2}. 
Then A is bounded, and is closed as a subset of Q, but is not compact. 

3. Suppose A is a compact subset of a metric space (S, d). Then A is 
sequentially compact. (Hint: otherwise there is a sequence (x,),-1 1n A with 
no subsequence converging to a point of A. It follows that for each xe A. 
there is an r(x) > 0 such that the ball N(x) = B,,,.(x) contains x, for only 
finitely many values of n. Since A is compact, this would imply that 
{1, 2, 3,...} is finite, a contradiction.) 

4. A metric space is said to be separable if there is a dense subset which is 
countable. If (S, d) is separable and A © S is sequentially compact, then A is 
compact. (Hint: suppose for each x € A we are given a neighborhood N(x). 
Let {x1, X2, X3,...} be a dense subset of S. For each x € A we can choose an 
integer m and a rational r,, such that x € B, (Xm) < N(x). The collection of 
balls B, (Xm) so obtained is (finite or) countable; enumerate them as C;, 
C.,.... Since each C; is contained in some N(x), it 1s sufficient to show that 
for some n, J?.1, C; > A. If this were not the case, we could take y, € A, 
yn €\UR-1 C;, n= 1,2,.... Applying the assumption of sequential com- 
pactness to this sequence and noting how the C, were obtained, we get a 
contradiction.) 
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A vector space over R is a set X in which there are an operation of addi- 
tion and an operation of multiplication by real numbers which satisfy certain 
conditions. These abstract from the well-known operations with directed line 
segments in Euclidean 3-space. 

Specifically, we assume that there is a function from X x X to X, called 
addition, which assigns to the ordered pair (x, y)—_ X x X an element of X 
denoted x + y. We assume 


Vil. (x +y)4+z2=x+(y + 2), all x, y, ze X. 
V2.x+y=ytx,allx,yeX. 

V3. There exists 0 € X such that x + 0 = x, all X. 

V4. For all x e X, there exists —x € X such that x + (—x) = 0. 


We assume also that there is a function from R x X to X, called scalar 
multiplication, assigning to the ordered pair (a, x) ¢ R x X an element of X 
denoted ax. We assume 


V5. (ab)x = a(bx), alla,be R,x eX. 

V6. a(x + y) = ax + ay, allae R, x, yeX. 
V7. (a + b)x = ax + bx, alla, be R,xeX. 
V8. Ix = x, allx eX. 
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Summarizing: a vector space over R, or a real vector space, is a set X with 
addition satisfying V1-V4 and scalar multiplication satisfying V5~V8. The 
elements of X are called vectors and the elements of R, in this context, are 
often called scalars. 

Similarly, a vector space over C, or a complex vector space, is a set X 
together with addition sstisfying V1I-V4 and scalar multiplication defined 
from C x X to X and satisfying V5-V8. Here the scalars are, of course, 
complex numbers. 


Examples 


1. R is a vector space over R, with addition as usual and the usual 
multiplication as scalar multiplication. 

2. The set with one element 0 is a vector space over R or C with 0 + 0 = 
0, a0 = 0, all a. 

3.IR" is a vector space of R if we take addition and scalar multiplication as 


(x1, XQ,-- +5 Xn) + (V1 Yar - ° > Yn) = (x, + Vi» X2 + Y2>- 29 Xn + Yn)s 
A(X, Xq5. + +5 Xn) = (AX1, AXg,..., AXn). 


4. C is a vector space over R or C with the usual addition and scalar 
multiplication. 

5. Let S be any set, and let F(S; R) be the set whose elements s are the 
functions from S to R. Define addition and scalar multiplication in F(S; R) by 


(f+ g)(s) = f(s) + gs), os eS, 
(af)(s) = af(s), se S,aeR. 


Then F(S; R) is a vector space over R. 

6. The set F(S; C) of functions from S to C can be made a complex 
vector space by defining addition and scalar multiplication as in 5. 

7. Let X be the set of all functions f: R —-> R which are polynomials, i.e., 
for some dp, @1,..-,4,ER, 


F(X) = Ao + Ay + Ggx* +-++ + a,x", allxeR. 


With addition and scalar multiplication defined as in 5, this is a real vector 
space. 

8. The set of polynomials with complex coefficients can be considered as 
a complex vector space. 


Let us note two elementary facts valid in every vector space: the element 
Q of assumption V3 is unique, and for any xe X, Ox = 0. First, suppose 
0’ — X has the property that x + 0’ = x for each x € X. Then in particular 
0 =0' +0=0+ 0 = 0 (using V2 and V3). Next, if x e X, then 


Ox = Ox + 0 = Ox + [0x + (—0x)] 
= [Ox + Ox] + (—0Ox) = (0 + 0)x + (—Ox) 
= Ox + (—0x) = 0. 
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Note also that the element —x in V4 is unique. In fact if x + y = 0, then 


y=y+O=y+e+ [x+ (-x)] = ly + x] + (-x) 
= [x + y] + (-x) = 0 + (—x) = (-x) + 0 = —-x. 


This implies that (—1)x = —x, since 
x + (-1)x = [1 + (-])]x = 0x = 0. 


A non-empty subset Y of a (complex) vector space X is called a subspace 
of X if it is closed with respect to the operations of addition and scalar multi- 
plication. This means that if x, ye Y and aeéC, then x + y and ay are in Y. 
If so, then Y itself is a vector space over C, with the operations inherited 
from X. 


Examples 


1. Any vector space is a subspace of itself. 

2. The set {0} is a subspace. 

3. {x | x, = 0} is a subspace of R”. 

4. In the previous set of examples, the space X in example 7 1s a subspace 
of F(R; R). 

5. Let X again be the space of polynomials with real coefficients. For each 
n= 0,1, 2,..., let X,, be the subset of X consisting of polynomials of degree 
<n. Then each X,, is a subspace of X. For m < n, X,, is a subspace of X,,. 


Suppose x,, Xo,...,X, are elements of the vector space X. A linear 
combination of these vectors is any vector x of the form 


KX = QyX, + ApXg +°-+ + AyXn, 
where 41, d2,..-., @, are scalars. 


Proposition 7.1. Let S be a nonempty subset of the vector space X, and let 
Y be the set of all linear combinations of elements of S. Then Y is a subspace of 
X andY> S. 

If Z is any other subspace of X which contains the set S, then Z > Y. 


Proof. If x,y¢Y, then by definition they can be expressed as finite 
sums X = > a,x; y = > b,y;, where each x,eS and each y,¢S. Then 
ax = > (aa,)x; is a linear combination of the x,’s, and x + y is a linear 
combination of the x,’s and the y,’s. If x eS, then x = Ixe Y. Thus Y is a 
subspace containing S. 

Suppose Z is another subspace of X containing S. Suppose x € Y. Then 
for some Xj, Xo,..., X, € S, X = > a,x;. Since Z is a subspace and Xj, Xo,..., 
x, €Z, we have a Xj, doX2,...,@,X,€Z. Moreover, a,x; + aoX,€Z, So 
(a,X,; + AeX2) + a3X3 € Z. Continuing we eventually find thatxe Z. [ 


We can paraphrase Proposition 7.1 by saying that any subset S of a 
vector space X is contained in a unique smallest subspace Y. This subspace is 
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called the span of S. We write Y = span (S). The set S is said to span Y. Note 
that if S is empty, the span is the subspace {0}. 


Examples 


Let X be the space of all polynomials with real coefficients. Let f, be the 
polynomial defined by f,(x) = x”. Then span {fo, fi,...,/,} is the subspace 
X,, of polynomials of degree < zx. 


A linear combination a,x; + @oXo +--- + a,x, Of the vectors x,, Xo,..., 
X,, iS said to be nontrivial if at least one of the coefficients a, do,..., a, is not 
zero. The vectors X;, Xg,..., X, are said to be linearly dependent if some non- 
trivial linear combination of them is the zero vector. Otherwise they are said 
to be linearly independent. More generally, an arbitrary (possibly infinite) 
subset S is said to be linearly dependent if some nontrivial linear combination 
of finitely many distinct elements of S is the zero vector; otherwise S is said 
to be linearly independent. (Note that with this definition, the empty set is 
linearly independent.) 


Lemma 7.2. Vectors X,, Xo,..., X, in X,n = 2, are linearly dependent if 
and only if some x; is a linear combination of the others. 


Proof. If xX,,X2g,...,X, are linearly dependent, there are scalars ay, 
Az,...,@,, not all 0, such that > a,x, = 0. Renumbering, we may suppose 
a, # 0. Then x, = >%.. (—a, 7 1a,)x;. 

Conversely, suppose x,, say, is a linear combination >7.. 5,x;. Letting 
a, = 1, anda; = —b,; for j => 2, we have Dajx,;=0. Q 


The vector space X is said to be finite dimensional if there is a finite subset 
which spans X. Otherwise, X is said to be infinite dimensional. A basis of a 
(finite-dimensional) space X is an ordered finite subset (x,, X2,..., X,) Which 
is linearly independent and spans X. 


Examples 


1. R” has basis vectors (€:,€2,...,€,), Where e, = (1,0,0,..., 0), 
e. = (0, 1,0,...,0),...,e, = (0,0,...,0, 1). This is called the standard 
basis in R*. 

2. The set consisting of the single vector 1 is a basis for C as a complex 
vector space, but not as a real vector space. The set (1, i) is a basis for C as a 
real vector space, but is linearly dependent if C is considered as a complex 
vector space. 


Theorem 7.3. A finite-dimensional vector space X has a basis. Any two 
bases of X have the same number of elements. 


Proof. Let {X;, X2,...,X,} span X. If these vectors are linearly inde- 
pendent then we may order this set in any way and have a basis. Otherwise 
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ifn > 2 we may use Lemma 7.2 and renumber, so that x, is a linear combina- 
tion >?2} a;x,;. Since span {x,, Xo,..., X,} = X, any x eX is a linear com- 
bination 


n 


n-1 n~-1 
x= > bx; = 2, bx; + b( 5, ax, 


=1 
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1 
(0, + b,4;)X;. 
j=l 
Thus span {x;, Xo,...,X,-i} = X. If these vectors are not linearly inde- 
pendent, we may renumber and argue as before to show that 


span {X,, X2,..., X,-2} = X. 


Eventually we reach a linearly independent subset which spans X, and thus 
get a basis, or else we reach a linearly dependent set {x,} spanning X and con- 
sisting of one element. This implies x; = 0, so X = {0}, and the empty set is 
the basis. 

Now suppose (X;, X2,...,X,) and (yi, Yo,..., Ym) are bases of X, and 
suppose m <n. Ifn = 0, then m = 0. Otherwise x, 4 0. The y,’s span X, so 
X, = > ay,;. Renumbering, we may assume a, # 0. Then 


Yi = 47 *X, — 2, a,~*ay;. 
Thus y, is a linear combination of x, yo,..., Yn. It follows easily that 
span {X15 Yoo. -, Ymt > Span {Y1, Yo,.--sYm}t = &- If m = 1 this shows that 
span {x,} = X, and the linear independence of the x,’s then implies n = 1. 
Otherwise xz = bx, + Dj. b,y,;. The independence of x, and x, implies 
some 5, # 0. Renumbering, we assume 5, # 0. Then 


y2 = bat xs — bx; — > by). 
j=3 
This implies that 
span {X), X2, Ya, ---» Ym} > Span {Xi, Yo,---» Ym} = X. 


Continuing in this way, we see that after the y,’s are suitably renumbered, 
each set {X,, Xo,..., Xx, Veris+ +> Ym} Spans X, k < m. In particular, taking 
k = m we have that {x,, Xo,...,X,} spans X. Since the x,’s were assumed 
linearly independent, we must haven < m. Thusn=m. [J 


If X has a basis with m elements, n = 0, 1,2,..., then any basis has n 
elements. The number n is called the dimension of n. We write n = dim X. 
The argument used to prove Theorem 7.3 proves somewhat more. 


Theorem 7.4. Suppose X is a finite-dimensional vector space with dimen- 
sion n. Any subset X which spans X has at least n elements. Any subset of X 
which is linearly independent has at most n elements. An ordered subset of n 
elements which either spans X or is linearly independent is a basis. 


32 Basic concepts 


Suppose (xj, X2,..., X,) is a basis of X. Then any x € X can be written as 
a linear combination x = > a,;x,;. The scalars a,, dz,..., a, are unique; in 
fact ifx = > b,x,, then 


O=x-—x= > (a, — b,x, 


Since the x,’s are linearly independent, each a, — b, = 0, i.€., a; = 6,. Thus 
the equation x = > a,x, associates to each x € X a unique ordered n-tuple 
(a, Ag, ..., 4,) Of scalars, called the coordinates of x with respect to the basis 
(x,,...,X,). Note that if x and y correspond respectively to (@;, da, ...5 An) 
and (b;, bz,...,5,), then ax corresponds to (aa;, dd2,...,aa,) and x + y 
corresponds to (a, + b;, dg + bo,...,a, + 5,). In other words, the basis 
(x1, Xo,..., Xn) gives rise to a function from X onto R” or C* which preserves 
the vector operations. 

Suppose X and Y are vector spaces, either both real or both complex. 
A function T: X -> Y is said to be Jinear if for all vectors x, x’ e X and all 
scalars a, 

T(ax) = aT(x), T(x + y) = T(x) + T(y). 

A linear function is often called a linear operator or a linear transformation. 
A linear function T: X — R (for X a real vector space) or T: X > C (for X 
a complex vector space) is called a linear functional. 


Examples 

1. Suppose X is a real vector space and (Xj, X2,...,X,) a basis. Let 
T(> ajx;) = (ai, dg, ..-, An). Then T: X — R’ is a linear transformation. 

2. Let T(z) = z*, ze C. Then Tis a linear transformation of C into itself 
if C is considered as a real vector space, but is not linear if C is considered as 
a complex vector space. 

3. Let fj: R" > R be defined by (x1, X2,..., Xn) = X;. Then f; is a linear 
functional. 

4. Let X be the space of polynomials with real coefficients. The two func- 
tions S, 7 defined below are linear transformations from X to itself. If 


f(x) = deo a,x’, then 


SA) = > G+ Daal 


T(f\(x) = S ja;x?-*, 


j=1 


Note that T(S(/)) =f, while S(7({)) = f if and only if ay = 0. 


Exercises 


1. If the linearly independent finite set {x,, X2,..., Xn} does not span X, 
then there is a vector x,,,¢X such that {x,, Xo,..., Xn, Xn42} 1S linearly 
independent. 
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2. If X is finite-dimensional and x,, X2,..., X, are linearly independent, 
then there is a basis of X containing the vectors x,, X>,..., Xn. 

3. If X is a finite-dimensional vector space and Y < X is a subspace, then 
Y is finite-dimensional. Moreover, dim Y < dim X, and dim Y < dim X 
unless Y = X. 

4. If Y and Z are subspaces of the finite-dimensional vector space X and 
YO Z = {0}, then dim Y + dim Z < dim X. 

5. Suppose Y and Z are subspaces of the finite-dimensional vector space 
X, and suppose span (Y U Z) = X. Then dim Y + dim Z > dim X. 

6. If Y is a subspace of the finite dimensional vector space X, then there 
is a subspace Z with the properties YO Z = {0}; dim Y + dim Z = dim X; 
any vector x € X can be expressed uniquely in the form x = y + z, where 
y € Y, ze Z. (Such subspaces are said to be complementary.) 

7. Prove Theorem 7.4. 

8. The polynomials f(x) = x", 0 < m <n, are a basis for the vector 
space of polynomials of degree <n. 

9. The vector space of all polynomials is infinite dimensional. 

10. If S is a non-empty set, the vector space F(S; R) of functions from 
S to R is finite dimensional if and only if S is finite. 

11. If X and Y are vector spaces and T: X -> Y is a linear transformation, 
then the sets 


N(T) = {x | xe X, T(x) = 0} 
R(T) = {T(x) | xe X} 


are subspaces of X and Y, respectively. (They are called the null space or 
kernel of T, and the range of T, respectively.) Tis 1-1 ifand only if N(T) = {0}. 

12. If X is finite dimensional, the subspaces N(7) and R(T) in problem 11 
satisfy dim N(7) + dim R(T) = dim X. In particular, if dim Y = dim X, 
then T is 1-1 if and only if it is onto. (Hint: choose a basis for N(T) and use 
problem 2 to extend to a basis for X. Then the images under T of the basis 
elements not in N(T) are a basis for R(7).) 


Chapter 2 


Continuous Functions 


§1. Continuity, uniform continuity, and compactness 


Suppose (S, d) and (S’,d’) are metric spaces. A function f: S— S" is 
said to be continuous ut the point x < S if for each e > O there isa 6 > O such 
that 


d'(f(x),f(y)) <2 — if'd(x,y) < 6. 


In particular, if S and S’ are subsets of R or of C (with the usual metrics) then 
the condition is 


ify) -fo)|<e  ifly—x| <6 


(This definition is equivalent to the following one, given in terms of con- 
vergence of sequences: f is continuous at x if f(x,) > f(x) whenever (Xn)n=1 
is a sequence in S which converges to x. The equivalence is left as an 
exercise.) 

Recall that we can add, multiply, and take scalar multiples of functions 
with values in C (or R): iff, g: S> CandaeC, xeS, then 


Cf + g(x) = f(x) + g(x), 
(af (x) = af(x), 
(fg\(x) = f(x)g(x), 
(flex) = f(x)/g(x) — if g(x) # 9. 


Proposition 1.1. Suppose (S,d) is any metric space, and suppose 
f, g: S—>C are functions which are continuous at x. Then f + g, af, and fg are 
continuous at x. If g(x) 4 0, then f/g is defined in a ball B,(x) and is continuous 
at x. 


Proof. Continuity of f + g and af at x follow from the definition of 
continuity and the inequalities 


If + gv) — + g)(x)| = [f(v) — f) + 8(y) — 8@)| 
< [f(y) — f()| + |e) — ge), 
laf )(v) — (af)(x)| = lal |f(y) — FO). 


To show continuity of fg at x we choose 8, > 0 so small that if d(y, x) < 96, 
then | f(y) — f(x)| < 1. Let M be the larger of | g(x)| and |/(x)| + 1. Given 
e > 0, choose 6 > Oso small that 


f(y) -— £0) < 2M, — |g(y) — 8(®)| < 4/2M 
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if d(y, x) < 5. Then d(y, x) < 6 implies 


I2(y) — a) = [Fe(y) — F)8)| 

fey) — fg) + fe) — f)g()| 
If()| |e(v) — g@)| + [F() — F%)| | e@)| 
[f(y)|-2/2M + M-e/2M 

|f(y)|-2/2M + e/2. 


ll IAW I 


But also 


If) = [f0) — F) + £&)] < [F() — FO) + IFO! 
< 1+ |f@)| < M, 


so |(/g)(y) — (48) < «. 
Finally, suppose g(x) # 0. Choose r > 0 so that | g(y) — g(x)| < 4|g(x)| 
if d(y, x) < r. Then if d(y, x) < r we have 


|e(x)| = |g(v) + g@) — a(y)| 
< |g(y)| + 4a), 


so |g(y)| = 4|g(x)| > 0. Thus 1/g is defined on B,(x). Since the product of 
functions continuous at x is continuous at x, we only need show that 1/g is 
continuous at x. But if y € B,(x), then 


[I/g(y) — I/gx)| = |g(v) — g@)|/|e()| [g@| 
< K|g(y) — g@)|. 


where K = 2/| g(x)|?. Since g is continuous at x, it follows that I/gis. 


A function f: S — S’ is said to be continuous if it is continuous at each 
point of S. 
The following is an immediate consequence of Proposition 1.1. 


Corollary 1.2. Suppose f, g: S—> C are continuous. Then f + g, af, and 
fg are also continuous. If g(x) # 0, all x, then f/g is continuous. 


A function f: S — S’ is said to be uniformly continuous if for each e > 0 
there is a 6 > O such that 


d'(f(x),f(y)) <2 if d(x, y) < 6. 


In particular, if S, S’ < C, then this condition reads 


fy) -f@|<e  if|y—x| < 


The distinction betwe encontinuity and uniform continuity is important. 
If fis continuous, then for each x and each « > 0 there is a 6 > O such that 
the above condition holds; however, 56 may depend on x. As an example, let 
S = S’ = RB,f(x) = x2. Then| f(y) — f(@)| = |y? — x7] = |y + xl ly — 2. 
If |x| is very large, then | y — x| must be very small for | f(y) — f(x)| to be 
less than 1. Thus this function is not uniformly continuous. (However it is 
clear that any uniformly continuous function is continuous.) 

In one important case, continuity does imply uniform continuity. 
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Theorem 1.3. Suppose (S, d) and (S’, d’) are metric spaces and suppose 
ft: S— S’ is continuous. If S is compact, then f is uniformly continuous. 


Proof. Given « > 0, we know that for each xeS there is a number 
d(x) > O such that 


d'(f(x), f(y) < te — if d(x, y) < 28(x). 
Let N(x) = Ba .(x). By the definition of compactness, there are points 


X1, Xo,...,X,E€S such that S< LU) N(x,). Let 6 = min {8(x,), d(x), ..., 
5(x,)}, and suppose d(x, y) < 6. There is some x, such that x € N(x). Then 


A(x;, x) < 8(xj) < 28(%), 

d(xi, y) < d(xi, x) + d(x, y) < A(x) + 5 < 28%), 
SO 

d'(f(x), f(y) < a’), FO) + aS), FY) 
<tette=e. U 
There are other pleasant properties of continuous functions on compact 

sets. A function f: S — C is said to be bounded if f(S) is a bounded set in C, 
1.e., there is an M > 0 such that 


If()| <M, allxeS. 


Theorem 1.4. Suppose (S, d) is a metric space and suppose f: S > C is 
continuous. Then f is bounded and there is a point X) € S such that 


|f(%o)| = sup {| f(x) |x € S}. 
If f(S) < R, then there are x, x € S such that 


f(x.) = sup (f(x) | xe S}, 
f(x) = inf (f(x) | x € S}. 
Proof. For each x € S, there is a number (x) > O such that | f(y) — f(x)| 
<1 if ye Bux(x) = N(x). Choose x,,...,X, such that S< LU M(x,). If 
x eS then x € N(x;) for some i, and 


[f(x)| < [fxd] + (£0) -— fxd] < [fxd] + 1. 


Thus we can take M = 1 + max {| f(x,)|,..., | f(x,)|} and we have shown 
that fis bounded. 

Let a = sup {| f(x)| | x € S}, and suppose | f(x)| < a, all x eS. Then for 
each x € S there are numbers a(x) < a and e(x) > 0 such that | f(y)| < a(x) 
if y € By(x) = M(x). Choose y1,..., ym such that S < ) M()y,), and let 
a, = max {a(yi),..., Aym)} < a. If xe S then x € M(y,) for some i, so 


| f(x)| < aly) < a, < a. 


This contradicts the assumption that a = sup {| f(x)| | x eS}. Thus there 
must be a point x with | f(x)| = a. 
The proof of the existence of x, and x_ when / is real-valued is similar, 


and we omit it. [ 
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Both theorems above apply in particular to continuous functions defined 
on aclosed bounded interval [a, b] < R. We need one further fact about such 
functions when real-valued: they skip no values. 


Theorem 1.5. (Intermediate Value Theorem). Suppose f: [a,b]—>R is 
continuous. Suppose either 


f(a)<c<f(b) or f(b)<c < f(a). 
Then there is a point Xo € [a, b| such that f(xo) = c. 


Proof. We consider only the case f(a) < c < f(b). Consider the two 
intervals [a, 4(a + b)], [4(a + 5), b]. For at least one of these, c lies between 
the values of f at the endpoints; denote this subinterval by [a,, b,]. Thus 
f(a) < c < f(b,). Continuing in this way we get a sequence of intervals 
[ans b,] with [Qn+1 bn+1) cS [ans b,], Bn+1 — Qn+1 = 30, ~~ an), and f(a) S 
c < f(b,). Then there is xp € [a, b] such that a, — Xo, b, — Xo. Thus 


F(X) = lim f(a,) < ¢. 
f(%o) = lim f(6,) = c. O 


Exercises 


1. Prove the equivalence of the two definitions of continuity at a point. 
2. Use Theorem 1.6 to give another proof of the existence of /2. Prove 
that any positive real number has a positive nth root, n = 1, 2,.... 
3. Suppose f: S—> S’, where (S, d) and (S’, d’) are metric spaces. Prove 
that the following are equivalent: 
(a) fis continuous; 
(b) for each open set A’ < S’, f~+(A’) is open; 
(c) for each closed set A’ < S$’, f~1(A’) is closed. 
4. Find continuous functions f;: (0, 1) > R, 7 = 1, 2, 3, such that 
jf, is not bounded, 
fz is bounded but not uniformly continuous, 
Ff; is bounded but there are no points x,, x_ € (0, 1) such that fa(x,) = 
sup (fax) | x €(0, 1D}, falx-) = inf {fa(x) | x € , 1}. 
5. Suppose f: S —> S’ is continuous and S is compact. Prove that /(S) is 
compact. 
6. Use Exercise 5 and Theorem 6.2 of Chapter | to give another proof of 
Theorem 1.4. 
7. Use Exercise 3 of Chapter 1, §6 to give a third proof of Theorem 1.4. 
(Hint: take (x,)?_, © S such that lim | f(x,)| = sup {f(x) | x € S}, etc.) 
8. Suppose (S, d) is a metric space, x € S, and r > 0. Show that there is 
a continuous function f: S > R with the properties: 0 < f(y) < l, all yeS, 
S(y) = Oif y € B(x), f(x) = 1. (Hint: take f(y) = max {1 — r-*d(y, x), 0}. 
9. Suppose (S, d) is a metric space and ‘suppose S is not compact. Show 
that there is a continuous f: S —> R which is not bounded. (Hint: use Exercise 
8.) 
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§2. Integration of complex-valued functions 
A partition of a closed interval [a, b] < R is a finite ordered set of points 
P = (Xo, X1,.--, X,) With 
A=X< xX <-°:+ <x, =5. 
The mesh of the partition P is the maximum length of the subintervals 


[Xi-1, Xi]: 
|P| = max {x, — x;-, |i = 1,2,..., n}. 


If f: [a, b] + C is a bounded function and P = (Xo, X1,..., X,) iS a parti- 
tion of [a, b], then the Riemann sum of f associated with the partition P is the 
number 


S(f: P) = > Soea, — x11). 


The function fis said to be integrable (in the sense of Riemann) if there is 
number ze C such that 
lim S(f;P) = z. 
|P|+0 
More precisely, we mean that for any « > O there isa 6 > O such that 
(2.1) ISG; P)-—zl)<e if |P| <6. 


If this is the case, the number z is called the integral of f on [a, b] and denoted 
by 


b b 
| f or | F(x) dx. 
If f: [a, b] -> C is bounded, suppose | f(x)| < M, all x é [a, b]. Then for 
any partition P of [a, b], 
ISF;P)| < > [fOdIGi — 1-1) < MD (i - m1) = Mb - 9). 
Therefore, if fis integrable, 


b 
| s 
Recall that f: [a,b] > C is a sum f = g + ih where g and + are real- 


valued functions. The functions g and h are called the real and imaginary 
parts of f and are defined by 


g(x) = Re(f(x)), = A(x) = Im(f(Q)), x € [a, 5). 
We denote g by Re fand h by Im f. 


Proposition 2.1. A bounded function f: [a, b] —> C is integrable if and only 
if the real and imaginary parts of f are integrable. If so, 


[r=f Ref +if ims 


(2.2) < M(b — a), M = sup {| f(x)| | x € [a, bd}. 
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Proof. Recall that if z = x + iy, x,y €R, then 
(2.3) 4|x| + $1y] < [2] < [x] + ly. 
If P is any partition of [a, b], then 

S(f;P) = S(Ref; P) + iS(mf; P), 


and S(Ref; P), S(Im/;P) are real. Let z = x + iy, x, y real, and apply 
(2.3) to S(f; P) — z. We get 


4|S(Ref; P) — x| + 4|S(m/f; P) — | 
< |S(f; P) — z| < |S(Ref; P) — x| + |SUm/f; P) — yl. 


Thus S(f; P) > zas | P| > Oifand only if S(Re f; P) > x and S(Im f; P) > 
yas |P|>0. 

Proposition 2.2. Suppose f:[a,b]—>C and g: [a,b]—>C are bounded 
integrable functions, and suppose c € C. Then f + g and cf are integrable, and 


[ U+8)- [s+ [ « [ of =e [4 


Proof. For any partition P of [a, 5], 
S(f+e;P)=S(fsP)+ S(g;P), Sf P) = cS; P). 
The conclusions follow easily from these identities and the definition. [] 


Neither of these propositions identifies any integrable functions. We shall 
see shortly that continuous functions are integrable. The following criterion 
is useful in that connection. 


Proposition 2.3. A bounded function f: [a, b] —> C is integrable if and only 
if for each e > 0 there is a & > 0 such that 


(2.4) ISP) — SF Q)| <e if |PI,|Q] < 6. 


Proof. Suppose f is integrable, and let z = [. f. For any ¢ > O there is a 
5 > O such that S(/; P) is in the disc of radius te around z if |P| < 6. Then 
|P| < 6,|Q| < Simplies S(/; P), S(f; Q) are at distance <e. 

Conversely, suppose for each e > 0 there is a 6 > O such that (2.4) holds. 
Take partitions P, with |P,| < 1/n,n = 1,2,3,..., and let z, = S(f; P,). 
It follows from our assumption that (z,)?_, is a Cauchy sequence. Let z be 
its limit. If n is large, |P,| is small and S(f; P,) is close to z, and if |Q| is 
small, S(f; Q) is close to S(f; P,). Thus S(f; Q) > zas |Q|—90. 


Theorem 2.4. Jf f: [a, b] > C is continuous, it is integrable. 


Proof. We know by §1 that f is bounded and uniformly continuous. 
Given e > 0, choose § > 0so that | f(x) — f(y)| < eif |x — y| < 8. Suppose 
P, OQ are partitions of [a, b] with |P| < 6, |Q| < 5. Suppose P = (Xo, %1,..., 
X,). Let P’ be a partition which includes all points of P and of Q, P’ = 
(Vo; Viv» + +s Ym). We examine one summand of S(f; P). Suppose 


Xi-1 = Vp-a < Vi <0 SV = XM 
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Then 


I(x) — Xi-1) — z/Orm — yi-1) 


(f(x) — FW) — Yi-1) 


Se — \i-1) = &(% — X-1), 


So 
since each |x; — y,| < 5. Adding, we get 

ISG; P) — SUP) < eb — a). 
Similarly, 

ISG Q) — SU P’)| < 6 — a), 


SO 


ISGP) — SCF; Q)| < 2b — a). 
By Proposition 2.3, fis integrable. [ 
We now want to consider the effect of integrating over subintervals. 


Proposition 2.5. Suppose a < b < c and f: [a,c]->C is bounded. Then 
ft is integrable if and only if it is integrable as a function on [a, b] and on [b, c]. 


If so, then 
[r-[r+]r 


Proof. Suppose f is integrable on [a,b] and on [b,c]. Given « > 0, 
choose 6 > 0 so that 


sip) -[-f|<de 


if P, Q are partitions of [a, b], [b, c] respectively, |P| < 6, |Q| < 5. Suppose 
P' is a partition of [a, c], |P’| < 6. If b is a point of P’, then P’ determines 
partitions P of [a, b] and Q of [b, c], |P| < 8, |Q| < 56. It follows from (2.5) 
that 


(2.6) susP) _ ) fp [ f < 8, 


If 5 is not a point of P’, let P” be the partition obtained by adjoining b. Then 
(2.6) holds with P” in place of P’. Suppose | f(x)| < M, all x € [a, b]. The 
sums S(f;P’) and S(/; P”) differ only in terms corresponding to the sub- 
interval determined by P’ which contains b. It is easy to check, then, that 


IS(f; P’) — S(f; P| < 28M. 


(2.5) 


S(f; Q) -{s| < he 


Thus 


sur) ~ [r-fr <e+28M 
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if |P’| < 6. It follows that fis integrable with integral I ft in ff. 

Conversely, suppose f is not integrable on [a, b] or [b, c]; say it is not 
integrable on [a, b]. Then there is an e« > O such that for any 5 > 0 there are 
partitions P,, P. of [a, b] with |P,| < 6, |P.| < 8, but 


SC; Pi) — S(f; P2)| > . 


Let Q be a partition of [b, c] with |Q| < 4, and let P;, P3 be the partitions of 
[a, c] containing all points of Q and of P,, P, respectively. Then | P;| < 4, 
| P,| < 6, and 


[SCF Pi) — SG Pa)| = |S Pi) — SG P2)| > e. 
By Proposition 1.3, fis not integrable. [ 
Suppose /: [do, bo] —> C is integrable, and suppose a, b € [do, bo]. Ifa < b, 


then fis integrable on [a, b]. (In fact fis integrable on [do, 5], therefore on 
[a, b], by two applications of Proposition 2.5.) If b < a, then fis integrable 


on [b, a] and we define 
| f--[ 2 


| f- 0. 


Then one can easily check, case by case, that for any a, b, c € [ao, bo], 


(2.7) fr-[ refs 


It is convenient to extend the notion of the integral to certain unbounded 
functions and to certain functions on unbounded intervals; such integrals are 
called improper integrals. We give two examples, and leave the remaining 
cases to the reader. 

Suppose f: (ad), b] > C is bounded and integrable on each subinterval 
[a, b], dog < a < b. We set 


We also define 


(2.8) { falim (f 


aan’ a 


if the limit exists. 
Suppose /: [a, 00) > C is bounded and integrable on each subinterval 
[a, bl], a < b < ow. We set 


(2.9) [ f = lim f 


bo 


if the limit exists; this means that there is a z € C such that for each e > 0 


if b = D(e). 


—- Z\<e 
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Exercises 


1. Let f: [0,1] R with f(x) = 0 if x is irrational, f(x) = 1 if x is 
rational. Show that fis not integrable. 

2. Let f: [0, 1] > R be defined by: f(0) = 0, f(x) = sin (1/x) if x 4 0. 
Sketch the graph. Show that fis integrable. 

3. Suppose f, g: [a, b] + C are bounded, f is integrable, and g(x) = f(x) 
except on a finite set of points in [a, b]. Then g is integrable and I g= I f. 

4. Suppose /: [a, b] > C is bounded and is continuous except at some 
finite set of points in [a, b]. Show that fis integrable. 

5. Suppose /: [a, b] — C is continuous and f(x) = 0, all x € [a, b]. Show 
that [” f = 0 implies f(x) = 0, all x € [a, 6}. 

6. Suppose /: [a, b] > C is bounded, integrable, and real-valued. Suppose 


[s| = M(b— a), where M = sup {|f(x)| | xé [a, d]}. 


Show that fis constant. 

7. Do Exercise 6 without the assumption that fis real-valued. 

8. Let f: [0, 1] ~ C be defined by: f(x) = 0 if x = 0 or x is irrational, 
f(x) = 1/q if x = p/q, p,q relatively prime positive integers. Show that / is 
continuous at x if and only if x 1s zero or irrational. Show that fis integrable 
and {> f = 0. 


§3. Differentiation of complex-valued functions 


Suppose (a, b) is an open interval in R and that /: (a, b) > C. As in the 
case of a real-valued function, we say that the function fis differentiable at the 
point x € (a, b) if the limit 


yox Vox 


exists. More precisely, this means that there is a number z eé C such that for 
any e > 0, there isa 6 > 0 with 


(3.2) IF) — FO) — xt - 2) <2 if0<|y—x| <6. 


If so, the (unique) number z is called the derivative of f at x and denoted 
variously by 


f'@, Df, o Lo. 


Proposition 3.1. Jf f: (a,b)—C is differentiable at x €(a, b) then f is 
continuous at x. 
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Proof. Choose 6 > 0 so that (3.2) holds with z = f’(x) and e = 1. Then 
when |y — x| < 6 we have 


f(y) — FOE) s [F0) — F@) -— 2y — *) + [2 - »)| 
< (1 + lz)iy — xl. 


As y> x, f(y) > f(x). 0 


Proposition 3.2. The function f: (a, b) — C is differentiable at x € (a, b) if 
and only if the real and imaginary parts g = Re fandh = Im fare differentiable 
at x. If so, then 


F(x) = g(x) + thx). 


Proof. As in the proof of Proposition 2.1, the limit (3.1) exists if and 
only if the limits of the real and imaginary parts of this expression exist. If so, 
these are respectively g(x) and A(x). 


Proposition 3.3. Suppose f: (a, b)—> C and g: (a, b)-—>C are differenti- 
able at x € (a, b), and suppose c € C. Then the functions f + g, cf, and fg are 
differentiable at x, and 


(f + g)'(x) =f) + 8°), 
(cf)'(x) = of (x), 
(fg)'(x) = f(x) a(x) + fx)a"). 


If g(x) 4 0 then f/g is differentiable at x and 
(f/g)'(x) = Lf’) g(x) — f)g'(x) a(x) ~?. 


Proof. This can be proved by reducing it to the (presumed known) 
theorem for real-valued functions, using Proposition 3.2. An alternative is 
simply to repeat the proofs, which are no different in the complex case. We 
shall do this for the product, as an example. We have 


(fe)\(y) = (fe)(x) = f(y) e(y) — F(x) a(x) 
= [f(y) — f)]g(y) + fOOLe(y) — g(x). 


Divide by (y — x) and let y ~ x. Since g(y) — g(x), the first term converges 
to f’(x)g(x). The second converges to f(x)g’(x). LD 


We recall the following theorem, which is only valid for real-valued 
functions. 


Theorem 3.4. (Mean Value Theorem). Suppose f: [a,b] —> R is contin- 
uous, and is differentiable at each point of (a, b). Then there is ac € (a, b) such 
that 


fc) = [F) — F@b — a)~*. 


Proof. Suppose first that f(b) = f(a). By Theorem 1.5 there are points 
c, and c_ in [a, b] such that f(c.) = f(x), all x € [a, b] and f(c_) < f(x), all 
x € [a, b]. If c, and c_ are both either a or b, then fis constant and f’(c) = 0, 


44 Continuous functions 


all c € (a, b). Otherwise, suppose c, € (a, b). It follows that [/(y) — f(c.)] x 
(y — c,) lis <Oify < c, and >Oify > c,. Therefore the limit as y > cx, 
is zero. Similarly, if c. # a and c_ # b, then f’(c_) = 0. Thus in this case 
f'(c) = 0 for some c € (a, Bb). 
In the general case, let 
g(x) = f(x) — & — af) — f@I16 - a)~*. 
Then g(a) = f(a) = g(b). By what we have just proved, there is a c € (a, b) 
such that 
0 = gc) =f") — LO) — f@I6 —- a). 0 


Corollary 3.5. Suppose f: [a, b] — C is continuous, and is differentiable at 
each point of (a, b). If f'(x) = 0 for each x € (a, b), then f is constant. 

Proof. Let g, h be the real and imaginary parts of f. Then g(x) = h'(x) = 
0, x €(a, b). We want to show g, / constant. If [x, y] < [a, 5], Theorem 3.1 
applied to g, h on [x, y] implies g(x) = g(y), A(x) = A(y). O 


Theorem 3.6. (Fundamental Theorem of Calculus). Suppose f: [a, b] > 
C is continuous and suppose c é [a, b]. The function F: [a, b] > C defined by 


Fx) = [ “f 


is differentiable at each point x of (a, b) and 
F(x) = f(x). 


Proof. Let g be the constant function g(y) = f(x), yeE(a, db). Given 
e > 0, choose § so small that | f(y) — g(y)| = |f(v) — f@)| < eif|y — x| < 
6. Then - 


(3.3) FO) - Fe) = [ fo ) “f= ) “f 


-|'s +] G8) 


= fy») +] 0-2). 


x 


If |y — x| < 4, then 


< |y — x| sup {| f(r) — f()| | |r — x] < |» — x}} 


< ely — x|. 


[ v-® 


Thus dividing (3.3) by (y — x) we get 
[[F(v) — FQ) My — x)-* — fQ@)| < «. 0 


Theorem 3.7. Suppose f: [a,b] > R is continuous and differentiable at each 
point of (a, b) and suppose f(x) > 0, all x € (a, b). Then f is strictly increasing 
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on (a,b). For each yeé[f(a) f(b)] there is a unique point x = g(y)€é [a, 5] 
such that f(x) = y. The function g = f~' is differentiable at each point of 
(f(a), f(6)) and 

g(y) = (sy). 

Proof. If x, y¢[a,b] and x < y, application of Theorem 3.4 to [x, y] 
shows that f(x) < f(y). In particular, f(a) < f(b). By Theorem 1.6, if 
f(a) < y < f(b) there is x € [a, b] with f(x) = y. Since fis strictly increasing, 
x is unique. Letting g = f~+ we note that g is continuous. In fact, suppose 
yeE(f(a), f(b)) and « > 0. Take y’, y” such that 


fa)sy<y<y" < fl) 
and y°’—y<e, y—y' <e. Let x’ = g(y’), x = g(y), x” = g(y’). Then 
x’ <x < x", Let 6 = min {x” — x,x — x’}. If|x — w| < dthenwe (x, x’), 
so f(w) €(y’, y”), so | f(w) — f(x)| = [f(w) — y| < e¢. Continuity at f(a) and 
F(6) is proved similarly. 
Finally, let x = g(y), x’ = g(y’). Then 


g(y") — gy) x'-x 
3.3 paw AON = =e 
e) yoy Tey-fe 
As y’ —> y, we have shown that x’ > x. Thus (3.3) converges to f’(x)7* = 


(f'(eQ))]-*. 0 


Proposition 3.8. (Chain rule). Suppose g: (a, b)-—>R is differentiable at 
x, and suppose f: g((a, b)) > C is differentiable at g(x). Then the composite 
function f o g is differentiable at x and 


(3.4) (f° g)'(x) = f'(g(x))g'@). 
Proof. We have 


(3.5) feg(y) —fea(x) = f(s(y) — f(s) 


_ f(g(y)) — f(g) gly) — a(x) (, ~ x) 
g(y) — g(x) y-x 
if g(y) # g(x). If g(x) # 0 then g(y) ¥ g(x) if y is close to x and y ¥ x. 
Taking the limit as y > x we get (3.4). Suppose g’(x) = 0. For each y near x 
either g(y) = g(x), so fo g(y) — fo g(x) = 0, or (3.5) holds. In either case, 


[fo e(y) — fe 2e(x)(y — x)~* is close to zero for y near x. Qj 
Proposition 3.9. (Change of variables in integration). Suppose g: [a, b] > 
R is continuous, and is differentiable at each point of (a, b). Suppose f: g({a, b]) 


—> C is continuous. Then 
g(b) 


“ f=] Gos’ 


Proof. Define F: g([a, b]) ~ C and G: [a, b] > C by 


F=f Ge)=| Cede’ 
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We want to prove that F(g(b)) = G(b); we shall in fact show that Fog = G 
on [a, b]. Since Fo g(a) = G(a) = 0, it suffices to prove that the derivatives 
are the same. But 


(Fog) =(F'cg)g’ =(fog)y’ =G. 0 


A function /: (a, b) —> C is said to be differentiable if it is differentiable at 
each point of (a,b). If f is differentiable, then the derivative f’ is itself a 
function from (a, b) to C which may (or may not) have a derivative f"(x) = 
(f’)'(x) at x € (a, b). This is called the second derivative of f at x and denoted 
also by 


yx), (. 


Higher derivatives are defined similarly, by induction: 


FOX) =f&), FX) =F), 
f¥* Mx) = fF”) (x), kk =0,1,2,.... 


The function /: (a, b) —> C is said to be of class C", or k-times continuously 
differentiable, if each of the derivatives f, f’,..., f° is a continuous function 
on (a, b). The function is said to be a class C®, or infinitely differentiable, if 
f® is continuous on (a, b) for every integer k > 0. 


Exercises 


1. Show that any polynomial is infinitely differentiable. 

2. Show that the Mean Value Theorem is not true for complex-valued 
functions, in general, by finding a differentiable function f such that f(0) = 
0 = f(1) but f(x) 4 Ofor0 < x < 1. 

3. State and prove a theorem analogous to Theorem 3.7 when f’(x) < 0, 
all x € (a, 5). 

4. Suppose f, g are of class C* and ce C. Show that f + g, cf, and fg are 
of class C*. 

5. Suppose p is a polynomial with real coefficients. Show that between 
any two distinct real roots of p there is a real root of p’. 

6. Show that for any k = 0,1, 2,... there is a function /: R — R which 
is of class C*, such that f(x) = 0 if x < 0, f(x) > 0 if x > 0. Is there a 
function of class C® having this property? 

7. Prove the following extension of the mean value theorem: if fand g are 
continuous real-valued functions on [a, b], and if the derivatives exist at each 
point of (a, b), then there is c € (a, 5) such that 


Lf) — f@ls’(©) = [s®) — s@lr'O. 
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8. Prove L’ Hopital’s rule: if fand g are as in Exercise 7 and if 
lim f'(@)Le'@)1> 
exists and f(a) = g(a) = 0, then 
tim f)g@))7? 


exists and the two limits are equal. 


$4, Sequences and series of functions 


Suppose that S is any set and that f: S > C is a bounded function. Let 
[f| = sup {|f(@)| | x € S}. 


A sequence of bounded functions (/,)r-1, from S to C is said to converge 
uniformly to f if 


lim |f, —/f| = 9, 


This sequence (/,;,)7°-1 1s said to be a uniform Cauchy sequence if for each e > 0 
there is an integer N so that 


(4.1) lf; —Sm| < e ifn,meNn. 


It is not difficult to show that if the sequence converges uniformly to a 
function f, then it is a uniform Cauchy sequence. The converse is also true. 


Theorem 4.1. Suppose (f,)r-1 is a sequence of bounded functions from a 
set S to C which is a uniform Cauchy sequence. Then there is a unique bounded 
function f: S — C such that (f,)r-1 converges uniformly to f. If S is a metric 
space and each f,, is continuous, then f is continuous. 


Proof. For each x € S, we have 


| Fux) — fin(x)| <s Tn — fini. 


Therefore (f,,(x))”_1 is a Cauchy sequence-in C. Denote the limit by f(x). We 
want to show that (/,)7-, converges uniformly to the function f defined in 
this way. Given « > 0, take N so large that (4.1) holds. Then for a fixed 
m> WN, 


Fa) — f)| = fal) — Tim f,()) 
= lim | fa(*) — fu] < e. 


Thus | fn —f| < ¢ if m = N, and (f,)n=1 converges uniformly to f. If the 
sequence also converged uniformly to g, then for each x € S, 


| fn(x) ~ g(x)| S [fn ~ g| 
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so f,(x) — g(x) asn— o, and g = f. The function fis bounded, since 
F%)| = |F) — fa) + fnl)| < & + [fnlx)| < & + [Snl, 


so |f| < e+ |fal, ifm = N. 
Finally, suppose each f, is continuous on the metric space S. Suppose 
xe Sande > 0. Choose N as above. Choose 6 > 0 so small that 


tiv) — fr) <e if d(y,x) < 8. 
Then 


f(y) — FO) < [F(Y) — fv) + [fv) — Fr) + [fv — FDI 
< |f — ful + [fr(y) — Av) + lf — Srl 
< 3e if d(y, x) < 64. 


Thus fis continuous. [J 


The usefulness of the notion of uniform convergence is indicated by the 
next theorem and the example following. 


Theorem 4.2. Suppose (f,)7=1 is a sequence of continuous complex-valued 
functions on the interval [a, b|, and suppose it converges uniformly to f. Then 


[t- lim te 


na? OO 


Proof. By (2.2), 


[ ft -{s| = If us -/)| < |f, —f|-|b - al. 


Asn—>o, this —>0. [ 


Example 


For each positive integer n, let f,: [0, 1] — R be the function whose graph 
consists of the line segments joining the pairs of points (0, 0), ((2n)~?, 2n); 
((2n)~*, 2n), (n=+, 0); (277, 0), (1, 0). Then f, is continuous, f,(x)-—>0 as 
n—> oo for each x € [0, 1], but f° f, = 1, alla. 


Here we are interested particularly in sequences of functions which are 
partial sums of power series. Associated with the sequence (a,)?°-» in C and 
the point Zp, € C is the series 


Ms 


(4.2) a,(Z — Zo)", zEeC, 


0 


n 


Recall from §3 of Chapter 1 that there is a number R, 0 < R < ©, such that 
(4.2) converges when |z — z.| < R and diverges when |z — zo| > R; R is 
called the radius of convergence of (3.2). The partial sums 


n 


(4.3) fAZ) = > An(Z — Zo)™ 


m=0 
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are continuous functions on C which converge at each point z with |z — zo| < 
R to the function 


(4.4) f@) = Dd, anlzZ — 20)", lz — 20] < BR. 
m=0 
Theorem 4.3. Let R be the radius of convergence of the power series (4.2): 
Then the function f defined by (4.4) is a continuous function. Moreover, the 
functions f,, defined by (4.3) converge to f uniformly on each disc 
D, = {z| |z-— Z| <r}, O<r<R. 


Proof. We prove uniform convergence first. Given 0 < r < R, choose s 
with r < s < R. Take w with |w — z,| = s. By assumption, > a,(w — Zo)” 
converges. Therefore the terms of this series + 0. It follows that there is a 
constant M so that 


la,(w — Zo)"| < M, n=0,1,.... 
Since |w — Z)| = s, this means 
(4.5) la,| < Ms~", n=0,1,..., 


Now suppose z€ D, and m < n. Then 


A) — fal = |S alz — zoy 


m+1 
< > lal |e - 20! 
< S Ms~ir? 
: mei gett yan 
-M > =m < TS 


where 6 = r/s < 1. As m-> oo the final expression on the right — 0, so 
(fr)=o is a uniform Cauchy sequence on D,. It follows that it converges to f 
uniformly on D, and that fis continuous on D,. Since this is true for each 
r < R, fis continuous. { 


In particular, suppose x, € R. The power series 
(4.6) >, In(x — Xo)" 
n=0 


defines a continuous function in the open interval (xo — R, Xo + R). Is this 
function differentiable ? 


Theorem 4.4. Suppose the power series (4.6) has radius of convergence R° 
Then the function f defined by this series is differentiable, and 


(4.7) f(x) = S na,(x — Xo)"~?, |x — Xo| < R. 
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Proof. To simplify notation we shall assume x) = 0. We claim first that 
the two series 


f¢) 12.9) 
(4.8) > na,x"—1, > n*la,|r"~? 
n=1 n=2 


converge uniformly for |x| < r < R. Take r < s < R. Then (4.5) holds. It 
follows that 


n n 
> |ma,x"-1| < M > ms~™rm—} 
m=1 m= 


= Mr > md", 
m=1 
$ =r/s < 1. Take e > 0 so small that (1 + «)5 < 1. By Exercise 4 of 
Chapter 1, §3, m < (1 + «)™ for all large m. Therefore there is a constant M’ 
so that 
m< M'(1 +)", m=1,2,.... 
Then 


n 


S |ma,x™~+| < rMM’' > (1 + «)™6". 
n=1 


m=1 


This last series converges, so the first series in (4.8) converges uniformly for 
|x| < r. Similarly, m? < (1 + «) for large m, and the second series in (4.8) 


converges uniformly for |x| < r. 
Let g be the function defined by the first series in (4.8). Recall that we are 
taking x, to be 0. We want to show that 


(4.9) Lfiy) — f(y — x*)7* — g(x) > 0 as yx. 


Assume |x| < r,|y| < r. Then the expression in (4.9) is 


(4.10) > aly" — 28 — mA — I-99 


Now 
y" — x" =(y — x)anlx, y) 


where 
gA(X,¥) = yet t xy 2 eee t xt 2y 4 mod, 
Thus 
\e(x,y)| sar"? if |x} <r, |yl <r. 
Then 


y" _ x” _ nx"—1(y _ x) 
= (y — xy" + xy" 2 +e) + xt) — x4] 
= (y — x)[(ytt — xt) + (yp? — xt) te Hy — xx? 
+ xr-1 _ x" 1] 
= (y — x)*[gn-i(x, y) + Sn-2lX, yx +++ + Bi(x, y)x"~7], 
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SO 
[y® — x” — nx®- My — x) < |y — x/? net? 
It follows that for |x| < r,|y|] < r we have 


ioe] 


ILA”) — FO)M(y — x)-? — g(x)| < > lanl [y — x|2n?r*-2] y — x]? 


n= 


ly —x| > nIa,|r"-? = Kly — xl, 
n=2 
K constant. Thus f’(x) = g(x). O 


Corollary 4.5. The function f in Theorem 4.5 is infinitely differentiable, 
and 


(4.11) FX) = > n(n — In — 2)--- (1 — K + Vag(x — x0)*-*, 
n=k 
|x — Xo| < R. 
Proof. This follows from Theorem 4.4 by induction onk. [ 


In particular, if we take x = x, in (4.11) then all terms of the series except 
the first are zero and (4.11) becomes 


(4.12) a, = (kN) t f(x). 


This means that the coefficients of the power series (4.6) are determined 
uniquely by the function f (provided the radius of convergence is positive). 


Exercises 


1. Find the function defined for |x| < 1 by f(x) = >@., x"/n. (Hint: 


f(x) = fFF) 
2. Show that if fis defined by (4.6), then 


[ f= > + 1a — xo 


3. Find the function defined for |x| < 1 by f(x) = >2_, nx™71, 

4. Suppose there is a sequence (x,)7-, such that |x,41 — Xo] < |x, — Xl, 
Xn —> Xp, and f(x,) = 0 for each n, where f is defined by (4.6). Show that 
f(x) = 0 for all x. (Hint: show that do, a,, d2,... are each zero.) 


§5. Differential equations and the exponential function 


Rather than define the general notion of a “differential equation”’ here, 
we shall consider some particular examples. We begin with the problem of 
finding a continuously differentiable function E: R — C such that 


(5.1) E(O) = 1, E'(x) = E(x), xeER. 
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Suppose there were such a function E, and suppose it could be defined by a 
power Series 


E(x) = > a,x". 


Then (5.1) and Theorem 4.4 imply 


ie 6) eo 
> na,x"~t = > a,x", 
n=1 


or 
c co 
> (n + l)a,41x" = > a,x". 
n=0 n=0 


Since the coefficients are uniquely determined, this implies 
Ane. = a,/(n + 1), n=0,1,2,.... 
But 
a) = E(0) = 1, 
so inductively 
a, = (n!)~1 = [n(v — 1) — 2)--- 1474. 
We have shown that if there is a solution of (5.1) defined by a power series, 


then it is given by 


(5.2) E(x) = S (n!)~ 4x". 


n=0 


The ratio test shows that (5.2) converges for all real or complex x, and 
application of Theorem 4.4 shows that E is indeed a solution of (5.1). We 
shall see that it is the only solution. 


Theorem 5.1. For each a,c €éC there is a unique continuously differenti- 
able function f: R — C such that 


(5.3) fO=c, f%)=af(x), xeR. 


This function is 
(5.4) f(x) = cE(ax) =c S (n!)~ta"x". 


Proof. The function given by (5.4) can be found by the argument used 
to find FE, and Theorem 4.4 shows that it is a solution of (5.3). To show 
uniqueness, suppose fis any solution of (5.3), and let g(x) = E(—ax), so that 


gO)=1, g(x) = —ag(x). 
Then fg is differentiable and 
(fg) =f'g + fg’ = afg — afg = 0. 
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Therefore fg is constant and this constant value is f(0)g(0) = c. If c # 0, this 
implies fg is never zero, So g is never zero. Then for any c, 


F(x) = e/g(x), alll x. 
Thus fis unique. {J 
This can be extended to more complicated problems. 


Theorem 5.2. For each a,c €C and each continuous function h: R + C 
there is a unique continuously differentiable function f: R — C such that 


(5.5) fOV=c, f(x) = af(x) + AC), xeER. 


Proof. Let fo(x) = E(ax), g(x) = E(—ax). As in the preceding proof, 
fog is constant, =1. Therefore neither function vanishes. Any solution f of 
(5.5) can be written as 


f=fifo, where fi = ef. 
Then 


f =fiho t+ fifo =fifo + of, 
so (5.5) holds if and only if 


AiO) = ¢, fa) fox) = A). 


These conditions are equivalent to 


A(x) =c + [ gh. 


Thus the unique solution of (5.5) is given by 


(5.6) fx) = cfolx) + fol) i ” gh 


cE(ax) + E (ax) { E(—at)h(t) dt. O 


Now we consider equations involving the second derivative as well. 


Theorem 5.3. For any b, c, do, d,€C€ and any continuous function 
h: R—>C there is a unique f: R — C of class C? which satisfies 


(5.7) f(0) = do, f'(0) = d,, 
(5.8) f'(x) + bf'(x) + f(x) = h(x), =x ER. 


Proof. To motivate the proof, we introduce two operations on functions 
of class C1 from R to C: given such a function g, let 


Dg=g', Ig =g. 
If g is of class C*, let D?g = g”. Then (5.8) can be written 
D*f + bDf + clf = h. 
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This suggests the polynomial z? + 2bz + c. We know there are roots aj, dy 
of this polynomial such that 


z*+ bz +c = (zZ — a,)(Z — ag), allzeC. 

Thus 

b = —a, — do, C = AyQ. 
From the properties of differentiation it follows that 

(D — a,D)[(D — al) f) = D*f — (a, + a2)Df + aoaif 
=f" + of + of 

Let 

g=f' —af=(D—a))f. 
We have shown that fis a solution of (5.8) if and only if 

(D — al)g = 8’ — ag =h. 


If also (5.7) holds, then g(0) = f’(0) — a.f(O) = dy — aed). Thus f is a 
solution of (5.7), (5.8) if and only if 


(5.9) {0O=h, f -af=g, 
where 
(5.10) 2(0) = d, — aed), g—age=h. 


But (5.10) has a unique solution g, and once g has been found then (5.9) has 
a unique solution. It follows that (5.7), (5.8) has a unique solution. {] 


Now we return to the function E, 
E(z) = > (n!)~ 22", zEC. 
Define the real number e by 
(5.11) e= E(1) = > (n!)~*. 


Theorem 5.4. The function E is a function from R to R of class C®. 
Moreover, 


(a) E(x) > 0,xeER, 

(b) for each y > 0, there is a unique x € R such that E(x) = y, 
(c) E(x + y) = E(x)E(y), all x, yER, 

(d) for any rational r, E(r) = e’. 


Proof. Since Eis defined by a power series, it is of class C®. It is clearly 
real for x real, and positive when x = 0. As above, E(x)E(—x) = 1, all x, 
so also E(x) > 0 when x < 0. 
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To prove (b), we wish to apply Theorem 1. Taking the first two terms in 
the series shows (since y > 0) that E(y) > 1 + y > y. Also, E(y~*) > y=}, 
SO 


E(—y"*) = E(y*)"* < (y*)"* = y. 


Thus there is x €(—y~’*, y) such that E(x) = y. Since E’ = E > 0, E is 
strictly increasing and x is unique. 

We have proved (c) when x = — y. Multiplying by E(—x)E(— y), we want 
to show 


E(x + y)E(—x)E(-y) = 1, allx, yeR. 


Fix y. This equation holds when x = 0, and differentiation with respect to x 
shows that the left side is constant. 
Finally, repeated use of (c) shows that 


E(nx) = E(x)", n=0, +1, +2,.... 
Thus 


e = E(1) = E(n/n) = E(1/n)", 
en — E(1/n), n= 1,2,3,.... 
emin — (el/n)m = E(1/n)™ = E(m/n). 0 


Because of (d) above and the continuity of E, it is customary to define 
arbitrary complex powers of e by 


(5.12) e = E(z) = S (n!)~ 12". 


The notation 
(5.13) e? = expz 


is also common. 
We extend part of Theorem 5.4 to the complex exponential function. 
(Recall that z* denotes the complex conjugate of z € C.) 


Theorem 5.5. For any complex numbers z and w, 
(5.14) E(z + w) = E(z)E(w), E(z*) = E(z)*. 


Proof. The second assertion can be proved by examining the partial 
sums of the series. To prove the first assertion, recall that we showed in the 
proof of Theorem 5.1 that E(zx)E(—zx) is constant, x¢R. Therefore 
E(z)E(—z) = E(0)? = 1. We want to show 


E(z + wyE(—z)E(—w) = 1, all z, weC. 
Let 
g(x) = E(z + w)x)E(—zx)E(— wx), xeER. 
Differentiation shows g is constant. But g(0) = 1. J 
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The notation (5.12) and the identity (5.14) can be used to consolidate 
expressions for the solutions of the differential equations above. The unique 
solution of 


{O=c, fi =afth 


1S 
(5.15) f(x) = ce** + | ° et*—Oh(t) dt. 


The unique solution of 


{(0O=h fO=a4 f+ gi +efr=h 


is given by 

(5.16) F(x) = doe*2* + [ eta —99(t) dt, 
where 

(5.17) g(x) = (dy — agdy)e"1* + [- e1%—Oh(t) dt, 


and d,, de are the roots of z2 + bz +c. 


Exercises 


1. Find the solution of 
fO=1, fM=0, f'-arf=0 
by the procedure in Theorem 5.3, and also by determining the coefficients in 


the power series expansion of f. 
2. Let f, g be the functions such that 


fO=1, fM=0, f"'+of+f=9, 
g0)=0, g(O)=1, ff" + bg +cg=0. 


Show that for any constants d,, d, the function h = d,f + d2g is a solution of 
(*) h" + bh'+ch=0. 


Show that conversely if / is a solution of this equation then there are unique 
constants d,, d,¢C such that h = d,f + d.g. (This shows that the set of 
solutions of (*) is a two-dimensional complex vector space, and (f, g) is a 
basis.) 

3. Suppose A(x) = >°-o d,x", the series converging for all x. Show that 
the solution of 


f0)=0=f0, ff +o +of=h 
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is of the form >.» a,x", where this series converges for all x. (Hint: deter- 
mine the coefficients ao, a,,... inductively, and prove convergence.) 

4. Suppose z? + bz2 + cz + d= (z — a,)\(Z — a2)(z — az), all zeC. 
Discuss the problem of finding a function f such that 


{09N=e, fM=a, f/M=e, (+d +c'+d=0. 


§6. Trigonometric functions and the logarithm 


In §5 the exponential function arose naturally from study of the differen- 
tial equation f’ = f. In this section we discuss solutions of one of the simplest 
equations involving the second derivative: f" + f = 0. 


Theorem 6.1. There are unique functions S,C: R->C of class C? such 
that 


(6.1) s0)=0, S(0)=1, S”’+S=0, 
(6.2) C(0)=1, C(0)=0, C"+C=0. 


Proof. Existence and uniqueness of such functions is a consequence of 
Theorem 5.3. [ 


Let us obtain expressions for S and C using the method of Theorem 5.3. 
The roots of z2 + 1 are z = +i. Therefore 


S(x) = { e~ *—Do(t) dt 


where 
g(x) = e™. 
Thus 
S(x) =| e~ Kx—-Hoit at = en | e2it at 
0 0 
_— e~ **(27)~ te2# 5 _ (27) ~1e- *(e?* _ 1), 
(6.3) S(x) = 7 (et — 7), 


A similar calculation gives 
(6.4) C(x) = 4(e* + e7*). 


Theorem 6.2. The functions S, C defined by (6.3) and (6.4) are real- 
valued functions of class C® on R. Moreover, 


(a) S’=C,C’ = -S, 
(b) S(x)? + C(x)? = I, allx eR, 
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(c) there is a smallest positive number p such that C(p) = 0, 
(d) if p is the number in (c), then 


S(x + 4p) = S(x), C(x + 4p) = C(x), allx ER. 
Proof. Since the exponential function exp (ax) is of class C® as a func- 


tion of x for each ae C, S and C are of class C®. Since (exp (ix))* = 
exp (—ix), S and C are real-valued. In fact, 


C(x) = Re (e’*), S(x) = Im (e*), 
SO 
e'* = C(x) + iS(x). 


Differentiation of (6.3) and (6.4) shows S’ = C, C’ = —S. Differentiation 
of S?2 + C2? shows that S(x)? + C(x)? is constant; the value for x = Ois 1. 

To prove (c), we suppose that C(x) 4 O for all x > 0. Since C(1) = 1 and 
C is continuous, the Intermediate Value Theorem implies C(x) > 0, all 
x > 0. Since S’ = C, S is then strictly increasing for x > 0. In particular, 
S(x) = S(1) > 0, all x = 1. But then 


0 < C(x) = C(1) + | " C(t) dt = C(I) — | ” S(t) dt 
< C(1) — | " s(t) dt = C(I) —(«— SQ), x= LL 


But for large x the last expression is negative, a contradiction. Thus C(x) = 0 
for some x > 0. Let p = inf {x | x > 0, C(x) = 0}. Then p > 0. There is a 
sequence (x,)* such that C(x,) = 0, p < xX, < p + I/n. Thus C(p) = 0, and 
p is the smallest positive number at which C vanishes. 

To prove (d) we note that 


1 = S(p)? + C(p)? = SY’, 


so S(p) = +1. But S(O) = O and S’ = Cis positive on [0, p), so S(p) > 0, 
Thus S(p) = 1. Consider S(x + p) asa function of x. It satisfies (6.2), and so 
by uniqueness we must have 


Six + p)=C(x), xeER. 
Similarly, — C(x + p) considered as a function of x satisfies (6.1), so 
C(x + p) = —S(x), xeER. 
Then 


S(x + 4p) = C(x + 3p) = —S(x + 2p) = —C(x + p) = SQ), 
C(x + 4p) = —S(x + 3p) = —C(x + 2p) = S(&x + p) = C2). O 


We define the positive number 7 by 
a = 2p, 


Trigonometric functions and the logarithm 59 


p the number in (c), (d) of Theorem 6.3. We define the functions sine and 
cosine for all ze C by 


(6.5) sin Zz = 5 ( iz _ gi?) = > [Qn + 1)!]-4(—1)%z2"*1, 
(6.6) cosz=-= 5 (e +e”) = > [2n!}]-4(—1)"z?". 


Note that because of the way we have defined z and the sine and cosine 
functions, it is necessary to prove that they have the usual geometric sig- 
nificance. 


Theorem 6.3. Let y: [0, 27] — R? be defined by 
y(t) = (cos ¢, sin fr). 


Then y is a 1-1 mapping of [0, 27] onto the unit circle about the origin in R?. 
The length of the arc of this circle from y(O) to y(t) is t. In particular, the 
length of the unit circle is 27. 


Proof. We know from Theorem 6.2 that (cos t)? + (sin t)? = 1, so 
(cos ¢, sin t) lies on the unit circle. The discussion in the proof of Theorem 6.2 
shows that on the interval [0, 477], cos t decreases strictly from 1 to 0 while 
sin ¢ increases strictly from 0 to 1. Therefore, y maps [0, 47] into the portion 
of the circle lying in the quadrant x > 0; y = 0 in a 1-1 manner. Further- 
more, suppose0 < x < 1,0 < y < l,and x? + y* = 1. By the Intermediate 
Value Theorem and the continuity of cosine, there is a unique ¢ € [0, 47] such 
that cost = x. Then sint > 0, (sint)? = 1 — x* = y*, and y>=0, so 
sin t = y. Thus y maps [0, 47] onto the portion of the circle in question. 

Since cos (t + 47) = —sint and sin(¢ + 47) = cost, the cosine de- 
creases from 0 to —1 and the sine decreases from 1 to 0 on [47, z]. As above, 
we find that y maps this interval 1-1 and onto the portion of the circle in the 
quadrant x < 0, y = 0. Continuing in this way we see that y does indeed 
map [0, 27) 1-1 onto the unit circle. 

The length of the curve y from (0) to y(t) is usually defined to be the 
limit, if it exists, of the lengths of polygonal approximations. Specifically, 
suppose 


=tp<tr<te<-<t, =H. 
The sum of the lengths of the line segments joining the points y(t;_,) and 
y(t), { = 1,2,...,m1s 
(6.7) > [(cos t; — cos t,_1)? + (sin 4, — sin t,_,)?]*. 
i=1 


By the Mean Value Theorem, there are ¢; and t; between t,;_, and ¢; such that 


cos t; — coSt;_, = —sin t,t, — t,-1), 
sin t; — sin t;_, = cos tj(t, — t-1). 
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Therefore, the sum (6.7) is 
> [(sin £7)? + (cos t7)?\(4, — 4-1). 
i=1 


Since sine and cosine are continuous, hence uniformly continuous on (0, f], 
and since (sin ¢)? + (cos ¢)? = 1, itis not hard to show that as the maximum 


length |t; — t,-,| > 0, (6.7) approaches ¢t. { 


This theorem shows that sine, cosine, and 7 as defined above do indeed 
have the usual interpretation. Next we consider them as functions from 
C to C. 


Theorem 6.4. The sine, cosine, and exponential functions have the following 
properties: 


(a) exp (iz) = cosz + isin z, all zeEC, 

(b) sin (z + 27) = sin z, cos (z + 27) = cos z, exp (z + 27i) = exp z, 
allzeC, 

(c) if weEC and w # 0, there is azeEC€ such that w = exp(z). If also 
w = exp (z’), then there is an integer n such that z’ = z + 2nzi. 


Proof. The identity (a) follows from solving (6.5) and (6.6) for exp (iz). 
By Theorem 2.2 and the definition of z, 


exp (Q7i) = cos 27 + isin 27 = 1. 
Then since exp (z + w) = exp Z exp w we get 
exp (z + 2mi) = exp z. 


This identity and (6.5), (6.6) imply the rest of (b). 
Suppose we C, w # 0. Letr = |w}. If x, y are real, 


|exp (iv)|? = |cos y + isin y|? = (cos y)? + (sin y)? = 1. 
Therefore 
lexp (x + iy)|? = |exp x exp (iy)| = |exp x] = exp x. 


To have exp (x + iy) = w, then, we must have exp x = r. By Theorem 5.4 
there is a unique such x € R. We also want exp (iy) = r~1w = a + Di. Since 
{r-+w| = 1, a? + b? = 1. By Theorem 6.3 there is a unique y € [0, 27) such 
that cosy =a, siny = b. Then exp(iy) = cosy + isiny =a-+ ib. We 
have shown that there are x, ye R such that if z = x + iy, 


exp Z = exp x exp (iy) = r-r-' w= w. 
Suppose z’ = x’ + iy’, x’, y’ real, and exp z’ = w. Then 

r = |w| = [exp z’| = |exp x’| = exp x’, 
x’ = x. There is an integer m such that 


2nz < y’ — y < Inv + 2z, 
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or 
y=yt+2nn+h, he [0,2z). 


Since exp z’ = exp Z, we have 
exp (iy) = exp (iy’) = exp ((y + 2nm7 + A)) = exp (iy + ih), 
SO 
1 = exp (—iy) exp (iy + ih) = exp (ih). 
Since h e€ [0, 277), this implies h = 0. Thus y’ = y + 2n7. 


The trigonometric functions tangent, secant, etc., are defined for complex 
values by 


tan z = sin z/cos z, zeEC, cos z # 0, 


etc. 
If w,z eC and w = exp z, then z is said to be a logarithm of w, 


z = logw. 


Theorem 6.4 shows that any w ¥ 0 has a logarithm; in fact it has infinitely 
many, whose imaginary parts differ by integral multiples of 27. Thus log w is 
not a function of w, in the usual sense. It can be considered a function by 
restricting the range of values of the imaginary part. For example, if w 4 0 
the z such that exp z = w, Im ze [a, a + 27) is unique, for any given choice 
ofaeR. 

If x > 0, it is customary to take for log x the unique real y such that 
exp y = x. Thus as a function from (0, oo) to R, the logarithm is the inverse 
of the exponential function. Theorem 3.7 shows that it is differentiable, with 


d d -1 
7 108 x) = (5 e*|y=0es) = e7108* = xo}, 
Thus 


(6.8) logx =| td, x>0. 


0 


Exercises 


1. Prove the identities 


sin (z + w) = sinzcos w + cos z sin w, 
cos (z + w) = coszcos w — sinz sin w 


for all complex z, w. (Hint: use (6.5) and (6.6).) 
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2. Show that tan x is a strictly increasing function from (—47, $7) onto 
R. Show that the inverse function tan~* x satisfies 


£ (tan-* x) = (1 + x’)? 


3. Show that i (1 + x*)"tdx = 7. 


4. Show that 
log(l +x) = | (1 + t)7* dt, —-l<x<o., 
0 
5. Show that 
oe Gat) eee 
log(l +x) = 2, x" l<x<l. 


(Hint: use Exercise 4.) 


§7. Functions of two variables 


Suppose A is an open subset of R?, i.e., for each (Xo, Yo) € A there is an 
open disc with center (Xo, yo) contained in A: 


A > {(x, y) | (« — Xo)? + (Y — Yo)? < 77}, some r > 0. 


In particular, A contains (x, yo) for each x in the open interval x) — r < 
x < X) + r, and A contains (Xo, y) for each y in the open interval yp — r < 


y<jyotr. 
Suppose f: A —> C. It makes sense to ask whether f(x, yo) is differentiable 
as a function of x at xo. If so, we denote the derivative by 


Di f(Xo; Yo) = lim [f(x, Yo) — I (os Yo) (x —- Xo)7*. 
x7*Xo 
Other common notations are 


of of 
Ox (Xo; Yo)s Ox (xqsVQ) FXos Yo)s D,f(Xo: Yo). 


Similarly, if f(xo, y) is differentiable at yy as a function of y we set 


Def (Xo, Yo) = lim Lf(x0, ¥) — F(X, Yo (y — Yo)~*- 


The derivatives D,f, D,f are called the first order partial derivatives of f. 
The second order partial derivatives are the first order derivatives of the first 


order derivatives: 


Df = D,(D,f), Df = D (Def), 
D, Def = D, (Def ); DD, f = D(Dif ). 
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Other notations are 


nn: i 
Ox?” Oy?” Ox dy ay Ox’ 


etc. 


Higher order partial derivatives are defined similarly. An (n + 1)-order 
partial derivative of fis D,g or D.g, where g is an n-order partial derivative 
of f. The function f: A > C is said to be n-times continuously differentiable, 
or of class C” if all the partial derivatives of f of order <n exist and are con- 
tinuous at each point of A. If this is true for every integer , then fis said to 
be infinitely differentiable, or of class C®. 


Theorem 7.1. (Equality of mixed partial derivatives). If f: A—C is of 
class C?, then D,D.f = D.D,f. 


Proof. Suppose (a, b)€ A. Choose r > 0 so small that A contains the 
closed square with center (a, b), edges parallel to the coordinate axes, and 
sides of length 2r. Thus (x, y)¢ A if 


|x-—al <r and |y—d| <r. 


In this square we apply the fundamental theorem of calculus to fas a function 
of x with y fixed, and conclude 


f(%, y) = | ° D, f(s, y) ds + f(a, y). 


Let g(y) = f(a, y). We claim that 


(7.1) Daf(x,y) =| DaDsfls,y)ds + 29). 


If so, then differentiation with respect to x shows D,D.f = D2D,f. To prove 
(7.1) we consider 


fos y + 6) — £09) — | ” DaDi f(s, y) ds — g'(y) 


=| beD.fs, 9 +.) — Difls,»)) — DaDifls, Yd 
+ [e~*(g(y + 2) — g(y)) -— 8’(y)]. 


The second term in brackets —> 0 as « + 0. If fis real-valued, we may apply 
the Mean Value Theorem to the first term and conclude that for each s and y 
and for each small e, there is a point y’ = y’(s, y, «) between y and y + e such 
that 


(7.2) e~*(D, f(s, y + e) ~ Di f(s, y)) ~ D2D, f(s, y) 

= DD, f(s, y’) — D2D, f(s, y). 
Now |y’ — y| < e, so |(s, y’) — (s, y)| < «. Since D.D,f is uniformly con- 
tinuous on the square |x — a] < r,|y — b| <r, it follows that the maximum 
value of (7.2) converges to zero as e — 0. This implies convergence to zero of 
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the integral of (7.1) with respect to s, proving (7.1) when / is real. In the 
general case, we look at the real and imaginary parts of fseparately. [ 


Remarks. In the course of proving Theorem 7.1 we have, in effect, 
proved the following. If fis a complex-valued function of class C+ defined on 
a rectangle |x — a| < r,,|y — b| < rg, then the derivative with respect to 
y of 


[ fe rdas 
a 
1s 
| Dof(s, y) ds. 
Similarly, the derivative with respect to x of 
y 
| fx, t) dt 
b 
1S 
y 
| D, f(x, t) dt. 
b 


We need one more result of this sort: if a = b, r, = s,, then 


y 
Fy) =| fisy)ds 
is defined for | y — a| < r,. The derivative is 


(7.3) | " Dafls, y) ds + f09). 


In fact 


Fy +) - FO) =| Wer +9-fsylds4 ec y) dy. 


Divide by e and let e > 0. By the argument above, the first integral converges 
to 


) ; Dof(s, y) ds. 


In the second integral, we are integrating a function whose values are very 
close to f(y, y), over an interval of length «. Then, dividing by «, we see that 
the limit is f(y, y). 

We need two results on change of order of integration. 


Theorem 7.2. Suppose f is a continuous complex valued function on the 
rectangle 


A={(x,y)|asx<bcsy< dq. 
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Then the functions 


as) =| ford, hy) =| fls,») as 


are continuous, and 
b d 
[ eax =| A) ay. 


Proof. The preceding remarks show that g and / are not only continuous 
but differentiable. More generally, 


{. S(s, t) dt, [ I(s, t) ds 


are continuous functions of s and of ¢ respectively. Define 


Fey) = | " { | “f(s, 1) ar} ds, 


F(x, y) = [ {{ f t) as} dt. 


We want to show that F,(b, d) = F,(b, d). The remarks preceding this theo- 
rem show that 


DF (x, y) = [1 y)ds = D2F.(x, y). 


Therefore, F, — F, is constant along each vertical line segment in the 
rectangle A. Similarly, D,F, = D,F,, so F, — F, is constant along each 
horizontal line segment. Since F(a, c) = F,(a,c) = 0, Fi = Fy. O 


The next theorem describes the analogous situation for integration over 
a triangle. 


Theorem 7.3. Suppose f is a continuous complex-valued function defined 
on the triangle 


A={(x,y)|0<x<a,0< yp < x}. 


| {[ 4 (x, y) ay} dx = [. {[ T(x, y) ax dy. 


Proof. Consider the two functions of t,0 < t < a, defined by 


f { f (x, y) ay} dx, [ { [ T(x, y) ax dy. 


By the remarks following Theorem 7.1, the derivatives of these functions with 
respect to ¢ are 


Then 


[sara f Mey) dy + | 0) dy 
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Thus these functions differ by a constant. Since both are zero when t = 0, 
they are identical. {] 


Finally we need to discuss polar coordinates. If (x, y) € R? and (x, y) # 

(0, 0), let 
r= (x? + y?)i2, 
Then 
(r-*x)? + (r-*y)? = 1, 
so there is a unique 6, 0 < @ < 27, such that cos 0 = r~*x, sin @ = r7}y, 
This means 
x =rcos 8, y=rsin@ 


r= (x? 4+ y?)?/2, 6 = tan~1(y/x). 


Thus any point p of the plane other than the origin is determined uniquely 
either by its Cartesian coordinates (x, y) or by its polar coordinates r, 0. A 
function defined on a subset of R? can be expressed either as f(x, y) or 
g(r, 0). These are related by 


(7.4) F(x, y) = g(x? + y*)', tan~* (y/x), 
(7.5) g(r, 6) = f(rcos 6, r sin 6). 


Theorem 7.4, Suppose f is a continuous complex-valued function defined 
on the disc 


Dp = {(x, y) | x? + y? < R}. 


Suppose g is related to f by (7.5). Then 


R (R2 — y2)1/2 R Qn 
| { | f(y) ax dy = { | g(r, 6)r ao} dr. 
-R — (R2 ~ y2)1/2 0 0 


Proof. Look first at the quadrant x > 0, y = 0. For a fixed y > O, if 
x = 0 then x = (r? — y?)??, Proposition 3.9 on coordinate changes gives 


(R2 — y2)1/2 R 
} I(x, y) dx = { f(r? — y?)", yr? — y?)~*r dr. 
0 r 


We integrate the integral on the right over O < y < R, and use Theorem 7.3 
to get 


Let y=rsin 0,6 € [0,47]. Then (r? — y?)? = rcos 6. We may apply 
Proposition 3.9 to the preceding integral of g-r over 0 < 6 < 4m. Similar 
arguments apply to the other three quadrants. [ 
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Exercises 


1. Suppose A < R? is open and suppose g: A —> C and h: A > C are of 
class C*?. Show that a necessary condition for the existence of f: A —> C such 
that 


(*) Dif = g, Def = h 


iS that Deg = D,)h. 
2. In Exercise 1, suppose A isa disc {(x, y) | (x — Xo)? + (y — yo)? < R?}. 
Show that the condition Dog = D,h is sufficient. (Hint: consider 


fe, = | als») ds + | hex, 1) dt.) 


§8. Some infinitely differentiable functions 


In §4 it was shown that any power series with a positive radius of con- 
vergence defines an infinitely differentiable function where it converges: 


fos) = ay = 2) 
We know 
An = (n!)~*f (Xo). 


In particular, if all derivatives of f are zero at Xo, then f is identically zero. 
There are infinitely differentiable functions which do not have this property. 


Proposition 8.1. There is an infinitely differentiable function f: R->R 
such that 


f(x)=0, x <0, 
f(x) > 0, x > 0. 


Proof. We define f by 


f(x) =0, x<0 
f(x) = exp(—I1/x), x>0. 


Near any point x # 0, fis the composition of two infinitely differentiable 
functions. Repeated use of the chain rule shows that fis, therefore, infinitely 
differentiable except possibly at zero. 

Let us show that fis continuous at 0. If y > 0, then 


co 6] 


ev = > (n!)~1y" < (m!)7ty", m=0,1,.... 


n=0 
Thus if x > 0, 
0 < f(x) = exp (—1/x) = exp (1/x)71 < m!(1/x)~™ = m! x", 
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m =0,1,.... In particular, f(x) > 0 as x 0. 
It is easy to show by induction that for x > 0, 


FOX) = pix") f(X), 


where p,(x) is a polynomial of degree < k + 1. Of course, this equation also 
holds for x < 0. Suppose we have shown that f™ exists and is continuous at 
0; then of course f(0) = 0. We have 


(8.1) LPF) — FOO) x3] = [FO%)x74| 
= |x "p(x *)f()]- 


Since p;, is of order <k + 1 and 
[f@)| < K& + 3)lxrko8, 


it follows that the right side of (8.1) converges to zero at x +0. Thus f+ (0) 
exists and is zero. Similarly, f%*?(x) = p,41(x7) f(x) > 0 as x +0, so 
ft” is continuous. [ 


Note that all derivatives of the preceding function vanish at zero, but fis 
not identically zero. Therefore f does not have a convergent power series 
expansion around zero. 


Corollary 8.2. Suppose a < b. There is an infinitely differentiable function 
g: R—R such that 


g(x) =0, x€(a,d), 
g(x) > 0, x € (a, b). 
Proof. Let /f be the function in Theorem 8.1 and let 


g(x) = f(x — afb — x). 
This is positive where x — a > 0 and where b — x < 0, and is zero else- 
where. It is clearly of class C”. QJ 


Corollary 8.3. Suppose a < b. There is an infinitely differentiable function 
h: R—R such that 
h(x) = 0, x <a, 
0 < A(x) < 1, 0<x <b, 
hx)=1, x2b. 


Proof. Let g be the function in Corollary 8.2 and let 
na) =e g(t) de 


0 


where c > 0 is chosen so that h(b) = 1. Then h’ = cg > 0, h is constant 
outside (a, b), etc. Q 


Chapter 3 


Periodic Functions and Periodic Distributions 


§1. Continuous periodic functions 


Suppose wu is a complex-valued function defined on the real line R. The 
function u is said to be periodic with period a # 0 if 
u(x + a) = u(x) 
for each x ER. If this is so, then also 
u(x + 2a) = u(x + a) +: a) = Ux + a) = Uy), 
u(x — a) = u((x — a) +. a) = u(x). 


Thus wu is also periodic with period 2a and with period —a. More generally, 
u is periodic with period na for each integer n. If u is periodic with period 
a ~ 0, then the function v, 


v(x) = u(|a|x/27) 


is periodic with period 27. It is convenient to choose a fixed period for our 
study of periodic functions, and the period 27 is particularly convenient. 
From now on the statement ‘“‘u is periodic” will mean “‘w is periodic with 
period 27.” In this section we are concerned with continuous periodic 
functions. We denote the set of all continuous periodic functions from R 
to C by @. This set includes, in particular, the functions 


sin nx, COS nx, exp (inx) = cos nx + isin nx 


for each integer 7. 
The set @ can be considered a vector space in a natural way. We define the 
operations of addition and scalar multiplication by 


(1) (u + v)(x) = u(x) + v(x), u, veg, xeER; 
(2) (au)(x) = au(x), UuEeGg, aeC, xeER. 


it is easily checked that the functions u + v and au are periodic. By Proposi- 
tion 1.1 of Chapter 2, they are also continuous. Thus u + ve@, aue@. The 
axioms V1-V8 for a vector space are easily verified. We note also that there 
is a natural multiplication of elements of &, 


(uv)(x) = u(x)v(x), u, vEE, xeER. 


The set @ may also be considered as a metric space. Since the interval 
[0, 27] is a compact set in R and since u € @ is continuous, 


sup |u(x)| < oo. 
xe[0,27] 
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We define the norm of u € @ to be the real number |u| where 
(3) lu| = sup |u(x)| = sup |u(x)]. 

xeR xe[0,27] 


The norm (3) has the following properties: 


(4) lu| > 0, and |u| =O only if u(x) = 0, all x; 
(5) lau| = jal |ul, aeC, uwe@; 
(6) ju + v| < ful + fol, u,vE®. 


The properties (4) and (5) are easily checked. As for (6), suppose x € R. Then 
\(u + v)(x)| = u(x) + v%)| < [u(x)| + [o@)| s [ul + |e}. 


Since this is true for every x € R, (6) is true. 
To make @ a metric space, we set 


(7) d(u, v) = |u — vj. 


Theorem 1.1. The set @ of continuous periodic functions is a vector 
space with the operations defined by (1) and (2). The set @ is also a metric 
space with respect to the metric d defined by (7), and it is complete. 


Proof. As we noted above, checking that @ satisfies the axioms for a 
vector space is straightforward. The axioms for a metric space are also 
easily checked, using (4), (5), and (6). For example, 


l(u—v)+(v-—w) 


d(u, w) = |u — w| = 
< |u — v| + |v — w| = dtu, v) + div, w). 


Finally, suppose (u,,)7_ 1; is a Cauchy sequence of functions in @. By Theorem 
4.1 of Chapter 2, there is a continuous function u: R->C such that 
|u, — u| +0. Clearly u is periodic, soue @ and @ is complete. {J 


Sets which are simultaneously vector spaces and metric spaces of this 
sort are common enough and important enough to have been named and 
Studied in the abstract. Suppose X is a real or complex vector space. A norm 
on X is a function assigning to each u € X a real number |ul, such that 


ju) > 0, and |u|] = 0 implies u = 0; 
au] = [al lu], a@scalar, ue X; 
lu+yv| < ful + |v, u,veX. 


A normed linear space is a vector space X together with a norm |u|. Associated 
to the norm is the metric 


d(u, v) = |u — yj. 


If the normed linear space is complete with respect to this metric, it is said 
to be a Banach space. 

In this terminology, Theorem 1.1 has a very brief statement: @ is a 
Banach space. 
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Suppose X is a complex normed linear space. A linear functional 
F: X — C is said to be bounded if there is a constant c > 0 such that 


|F(@u)| <clu|, allueX. 


Proposition 1.2. A linear functional F on a normed linear space X is 
continuous if and only if it is bounded. 


Proof. Suppose F is bounded. Then 
|F(u) — F(y)| = |F(u — v)| < clu — yI, 

so |F(u) — F(v)| < cif |u — vj < cute. 

Conversely, suppose F is continuous. There is a 6 > 0 such that 
|u| = |u — O| < 6 implies 

|F@)| = |F(u) — F@)| <1. 
For any u 4 0, uc X, the vector v = 6|u|~?u has norm 56. Therefore 
|F(u)| = [F(8~*uly)| = 6~* ul |F(W)| < 3-“lul, 

and Fis bounded. [J 


It is important both in theory and practice to determine all the con- 
tinuous linear functionals on a given space of functions. The reason is that 
many problems, in theory and in practice, can be interpreted as problems 
about existence or uniqueness of linear functionals satisfying given con- 
ditions. The examples below show that it is not obvious that there is any 
way to give a unified description of all the continuous linear functionals on 
@. In fact one can give such a description (in terms of Riemann-Stieltjes 
integrals, or integrals with respect to a bounded Borel measure), but we shall 
not do this here. Instead we introduce a second useful space of periodic 
functions and determine the continuous linear functionals on this second 
space. 


Exercises 


1. Suppose (a,)?_--« iS a (two sided) sequence of complex numbers 
such that 


fe 0] 


> |a,| < 00; 


n=— 


here we take the infinite sum to be 


Show that the function u defined by 


u(x) = S a, eXp (inx) 


n=—~ © 
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is continuous and periodic 
2. Suppose u: R—>C is a continuous function and suppose there is a 
constant M such that u(x) = 0 if |x| = M. Show that for any x the series 


oo 


(x)= > u(x + 2nz) 
converges. Show that the function v is in @. 
3. If ue @, define the real number |u|’ by 


jul’ = 2m)-*| |u| a 


Show that |u|’ is a norm on @ and that |u|’ < |ul. 

4. Suppose d’ is the metric associated with the norm |u|’ in Exercise 3. 
Show that @ is not complete with respect to this metric. (Hint: take a sequence 
of functions (u,)”-, of functions in @ such that 


0 < u,(x) < 1, xeR,n = 1,2,..., 
u,(x) = 0, x € [0, 7/2 — 1/n] U [32/2 + 1/n, 27], 
u,(x) = 1, x € [2/2, 37/2]. 


Then |u, — Un|’>0 as n,m—oo. If ue, there is an open interval 
(7/2 — 8, 7/2 + 8) on which either |u(x)| > 4 or |u(x) — 1| > 4. Show that 
|u, — ul’ > 8/6m for large values of n.) 

5. Which of the following are bounded linear functionals on @, with 
respect to the norm |u|? 


(a) F(u) = u(n/2), 
(b) F(u) = I °7 sin nx u(x) dx, 


(c) F(u) = [5” (u(x)? dx, 
(d) F(u) = 17u(0) + J?" u(x) dx. 
(e) F(u) = —3|u(0)|. 


6. Suppose X is a normed linear space. Let X’ be the set of all bounded 
linear functionals on X. Then X’ is a vector space. For F € X’, let 


|F| = sup {|F(w)| | we X, Ju] < 1}. 
Show that |F| is a norm on X’. Show that for any ue X and Fe X’, 
[F(u)| < |F| |ul. 


Show that X’ is a Banach space with respect to this norm. 


§2. Smooth periodic functions 


Suppose u: R->C is a continuous periodic function, and suppose that 


the derivative 
Du(x) = u'(x) 
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exists for each x € R. Then Du is also periodic: 


Du(x + 27) = lim h-1[u(x + 22 + h) — u(x + 27)] 
= lim h-*[u(x + h) — u(x)] = Du(x). 


In particular, if u is infinitely differentiable and periodic, then each deriv- 
ative Du, D7u,..., D*u,... isin @. 

We shall denote by F the subset of @ which consists of all functions 
ué@ which are smooth, i.e., infinitely differentiable. Such a function will be 
called a smooth periodic function. If uis in Z then the derivatives Du, D?u,. . . 
are alsoinA 

The set F is a subspace of @ in the sense of vector spaces, so it is itself a 
vector space. The function |sin x| is in @ but not in FZ so 7 # ©. We could 
consider FY as a metric space with respect to the metric on @ given in the 
previous section, but we shall see later that F is not complete with respect 
to that metric. To be able to consider F as a complete space we shall intro- 
duce a new notion of convergence for functions in Z 

A sequence of functions (u,)r., © F is said to converge to ue F in the 
sense of Ff if for each k = 0,1, 2,..., 


|D*u, — D'ul| +0 as ho. 
(Here D°u = u.) We denote this by 
u, —>u (P). 
Thus (u,)7., converges to uw in the sense of FY if and only if each derivative 
of u, converges uniformly to the corresponding derivative of u as n —> oo. 

A sequence of functions (u,)r., 1s said to be a Cauchy sequence in the 
sense of Ff if for each k = 0,1,..., (D*u,)?_, is a Cauchy sequence in @. 
Thus 

| D*u, — D*u,|>O as n,m—>oo 
for each k. 

When there is no danger of confusion we shall speak simply of “‘con- 
vergence”’ and of a “Cauchy sequence,” without referring to the “the sense 
of A” The statement of the following theorem is to be understood in this 
way. 

Theorem 2.1. The set FP of all smooth periodic functions is a vector space. 
If (u,)-1 < F is a Cauchy sequence, then it converges to a functionue FZ 


Proof. As noted above, F is a subspace of the vector space @: ifu, ve P 
then u -+ ve FY aue Z& Thus Ais a vector space. 


Suppose (u,)°., is a Cauchy sequence. For each k the sequence of 
derivatives (D*u,)?., is a Cauchy sequence in @. Therefore it converges 
uniformly to a function v,¢€%@. For k = 0, 1, 2,..., 


D¥ug(x) = D*u,(0) + | D¥+4y,(t) dt. 
0 
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By Theorem 4.2 of Chapter 2, 


v(x) = lim D*u,(x) = lim D*u,(0) + lim | D¥*1y,(t) dt 


04(0) + | de ealt) a 

0 
Therefore Dv, = v,41, all kK. This means that if u = vo, then v, = D©u and 
| D*u, — D*u| > 0 as n—> oo. Thus u, —> u (in the sense of F). [ 


The remainder of this section is not necessary for the subsequent develop- 
ment. We show that there is no way of choosing a norm on F so that con- 
vergence as defined above is equivalent to convergence in the sense of the 
metric associated with the norm. However, there is a way of choosing a 
metric on F (not associated with a norm) such that convergence in the sense 
of F is equivalent to convergence in the sense of the metric. Finally, we 
introduce the abstract concept which is related to F in the way that the 
concept of “‘ Banach space”’ is related to @. 

Suppose there were a norm |u|’ on # such that a sequence (u,)r-1 © F 
converges in the sense of 7 to ue F if and only if 


lu, — ul’ —>0. 
Then there would be a constant M and an integer N such that 
(1) |u|’ < M(\u| + |Dul +---+ |[D%u|), allucF 


In fact, suppose (1) is false for every M, N. Then for each integer n there 
would be a u, € F such that 


|up|’ > n(|un| + |Du,| +--++ |D*u,|). 

Let 
Vz = (|Un|’)~7*Un. 
Then 
| D¥v,| = (lun|')~*|D*u,| <n-* = ifn > k, 

so v, —> 0 in the sense of Z But |v,|’ = 1, all n. This shows that the norm |u|’ 
must satisfy (1) for some M, N. Now let 

w(x) = n-*~* sin nx, n=1,2,.... 


Then w, € A and 


IA 
= 


|D'w,| = n*-*-2 <n7?, k 
Thus by (1), 
|wal’ — 0, 


But |D"++w,| = 1, all n, so (w,)¢-1 does not converge to 0 in the sense of P. 
This contradicts our assumption about the norm |u|’. 
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Although we cannot choose a metric on ¥ associated with a norm, which 
gives the right notion of convergence, we can choose a metric as follows. Let 


d'(u,v) = > 2-*-3| Dey — D*o|[1 + |D'u — D*o|]-3. 
k=0 
The term of this sum indexed by k is non-negative and is smaller than 
2~*~-?, Thus 
d'(u, v) < I, u,vEeF. 
It is clear that 


d'(u, v) = 0, d'(u,v) = 0 implies u = »v, 
d'(u, v) = d'(v, u). 


The triangle inequality is a little more difficult. Let 
d(u, v) = |u — vj, d*(u, v) = d(u, v)[1 + d(u, v)]71. 
The reader may verify that 
d*(u, w) < d*(u, v) + d*(v, w). 
Then 


d’(u, w) 


> 2-*-1d*(D'u, D'w) 
k=0 


IA 


-++< d'(u,v) + d'(v, w). 


Theorem 2.2. A sequence of functions (U,)r=1 < F is a Cauchy sequence 
in the sense of FP if and only if it is a Cauchy sequence in the sense of the metric 
d'. Thus (P, d’) is a complete metric space. 


Proof. Suppose (u,)r=1 1s a Cauchy sequence in the sense of Z Suppose 
e > 0 is given. Choose k so large that 2~* < «. Choose N so large that if 
m> Nandn => N, then 


|D’u, — Diu,| <4te, jf =0,1,...,k. 
Then if m,n > N, 


"(Uns Un) = D271 1d*(D'iq, Diy) 
j=0 


k 
< > 2-7 Ge) + DS 2-771 
j=0 j=kt+1 


<te+2-*-1 < de + te. 


Conversely, suppose (u,,)7_ 1 is a Cauchy sequence in the sense of the metric d’. 
Given an integer k > O and an e > 0, choose WN so large that if m,n > N 
then 


d' (tims Up) < 27*~2e, 
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For m,n => N, 


|D¥u, — D*u,| = 2**2-2-*-1.2-1/D'y, — D*up| 
< 2k+2.9--1.g*(D*y, D*tun) 
< 2**2d"(un, Um) < €. 


Thus (u,).1 is a Cauchy sequence in the sense of Z 
The same argument shows that d'(u,, u) > 0 if and only if u,—>u (PY). 
Thus (7, d') is complete. 


There is an important generalization of the concept of a Banach space, 
which includes spaces like & Let X be a vector space over the real or complex 
numbers. A seminorm on X is a function u — |u| from X to R such that 


lu] > 0, |au| = {al jul, ju + vj < ju] + [vj. 


(Thus a seminorm is a norm if and only if |u| = 0 implies u = 0.) Suppose 
there is given a sequence of seminorms on X, |uJ,, |u|.,..., with the property 
that 


(2) lu], = 0, all k implies u = 0. 
Then we may define a metric on X by 
d'(u, vy) = pe" — vi,{1 + Ju — vj,J7?. 
If X is complete with respect to the metric d’, it is said to be a Frechet space. 
Note that 
d'(u,,V¥)>0O as n->© 
is equivalent to 
lu, — Vj, 70 as n->oo, forall k. 
In particular, if we take X = Y and 
luli, = |D*~*ul, 


then d’ agrees with d’ as defined above. Thus Theorems 3.1 and 3.2 say that 
P is a Frechet space. 


Exercises 


1. Which of the following are Cauchy sequences in the sense of F? 
u,(x) = n~* cos nx, 
v,(x) = (n!)~? sin nx, 


n 


W(x) = > (m!)~+ sin mx. 


m=1 
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2. Suppose X is a complex vector space with a sequence of seminorms 
[uj,, julo,..., satisfying (2). Let d’ be the associated metric. Show that a 
linear functional F: X — C is continuous if and only if there are a constant 
M and an integer N such that 


|F(u)| < M(juj, + jul2 +---+ july),  allueX. 


§3. Translation, convolution, and approximation 


The aim of this section and the next is to show that the space Y of smooth 
periodic functions is dense in the space @ of continuous periodic functions; 
in other words, any continuous periodic function u is the uniform limit of a 
sequence (u,)r_, of smooth periodic functions. Even more important than 
this theorem is the method of proof, because we develop a systematic 
procedure for approximating functions by smooth functions. 

The idea behind this procedure is that an average of translates of a 
function u is smoother than uw itself, while if the translated functions are 
translated only slightly, the resulting functions are close to wu. To illustrate 
this the reader is invited to graph the following functions from R to R: 


u,(x) = |x|, 
ua(x) = $x — el + Hx + el, 
Ug(x) = 4|x — e| + 4|x| + 4]x + el, 


where « > 0. 
If ue @ and te R, the translation of u by t is the function T,u, 


Tiu(x) = u(x — ft), xeER. 


Then T,ue@. The graph of 7, is the graph of u shifted ¢ units to the right 
(i.e., shifted |t| units to the left, if t < 0). In these terms the functions above 
are 


un = 47T,u, + 4T_ my, ug = 47,u, + 47T,u, + 47 _ uy. 
More generally, one could consider weighted averages of the form 
(1) W = Qol,,uU + aT,,u +--+4+ a,T,u, 
where 

a, > 0, Ao ta,t+-:-t+a,=I1, 
and most of the ¢, are near 0. If 
O0O<t<t, <::-<t, =27 
and we set 
(ty) = Ate — te-1)~* 


then (1) becomes 


(2) w= > d(H MTi,UMte — te-1): 
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The natural continuous analog of (2) is the symbolic integral 
y= | ” A(t) Tu dt, 

defining the function 

(3) v(x) = \ ” p(t)uce — 1) dt. 


We wish to study integrals of the form (3). If u, ve @, the convolution of 
u and v is the function u * v defined by 


| 22 
(4) (u# of(x) = | wx — y)o(y) dy. 
0 
It follows readily from (4) that 
l 2n 
(5) Jus eo] < |uls-[  |o(a)] de < ul [o} 
0 


Proposition 3.1. fu, ve@, thenu*veéE@. Moreover 


(6) uU*xv = vU*U, 

(7) (au) * v = a(u* v), aeC, 

(8) (ut+tv)*w=uxwt+oxw, weG, 
(9) (uxv)*w=ux(v*w), 

(10) Tu * v) = (Ty) *v = ux (Ty). 


Proof. We begin with part of (10). 


(I) Tis o)(x) = (uw vee — 9) = = | ue — 1 = yoy) dy 


= 5-| Tax — yoy) dy = (Tu) #0. 


Therefore, 
(12) Tu xv) — ux o| = |(Tu — v) #0] 


< [Tu — u| |v}, 


where we have assumed (7) and (8). Now w is uniformly continuous on [0, 27] 
and is periodic; it follows easily that u is uniformly continuous on R. There- 
fore |T,u — u| >0 as t->0. Then (12) implies continuity of u * v. Also, 


Ton(u * v) = (Tz,t) * 0 = ux B, 


SO uU* U IS periodic. 

The equality of (6) follows from a change of variables in (4): let y’ = 
x — y, and use the periodicity of u and v. Equalities (7), (8), and (9) are easy 
computations. The last part of (10) follows from (11) and (6). 4 
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Note that 
t- [u(t + x) — u(x)] = tA _ wu — uj(x). 
Lemma 3.2. [fue then 
j¢-(T_u — u) — Du| +0 as t—-0. 
Proof. By the Mean Value Theorem, 
t~*[T_wu — u}(x) — Du(x) = Du(y) — Du(x) 


where y = y(t, x) lies between x and x + t. Since Duis uniformly continuous, 
t-1(T_,u — u) converges uniformly to Duast—>0. 


Corollary 3.3. Jfue then 
t-(T_u—u)—>u (PY) 
as t 0. 
Proof. It is easy to see that 
D*(T_u) = T,(D*u). 
Then 
D*[t-(T_wu — u)] = t7(7_,D*u — D*u), 
which converges uniformly to D(D*u) = D*(Du). [ 
Proposition 3.4. JfuePandve@ thenuxveEePF and 


(13) D*(u x v) = (D*u) « v, all k. 
Proof. By Proposition 3.1, 
(14) t-7[T_(ux*v) — (ux*v)] = [t-(T_wu — u)] * v. 


By Lemma 3.2 and (5), the expression on the right in (14) converges uni- 
formly to (Du) « v as t-> 0. Thus 


D(u * v) = (Du) * v, 
and u *« v has a continuous derivative. But, Due & so we also have 
D*(u * v) = D((Du) * v) = (D2u) « v. 
By induction, (13) holds for allk. Q 


Corellary 3.5. Suppose (v,)f < €, vEe@, and |v, — v| 0. Then for 
eachueF 


U*V,—>uU*xv (PA). 
Proof. For each k, 


D*uxv, — uxv) = D'ux(v, — v). 
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It follows from this and (5) that 
| D¥(u*x v, — u*v)| +0, 


allk. Q 


Having established the general properties of convolution, let us return 
to the question of approximation. Suppose ue @. If (y,)f-1 is a sequence of 
functions in Z then each function 

Un = Pn *U 
is smooth. Thinking of u, as a weighted average of translates of u, we can 
expect u, to be close to u if g, has average value 1 and is concentrated near 
0 and 27 (as a function on [0, 27]). 
A sequence (9,)r=1 © @ is said to be an approximate identity if 
(i) p(x) = 0, all n, x; 
(ii) 1/2m >" p(x) dx = 1, all n; 
(iii) for each 0 < 6 < a, 
21=~ 6 
| p,(x)dx-—>0O as n-—->0o. 
J6 

Theorem 3.6. Suppose (9,)£ < @ is an approximate identity. Then for 

each ue G, 

lp,*u—ul| +0 as n->o. 
Moreover, ifueéF then 

G,*uU—>u (Pf) as n—->©O. 


Proof. Since (27)~+ I ; “,(y) dy = land o, *u = u* g,, we have 


aml(@, * u)(x) — u(x) [ ule — y)pa(y) dy — u(x) [ “eal y) dy 


| [u(x — y) — u(x)]p.(y) “| 


6 2n—6 2 
silt i+ 

0 6 2n=-6 

re) 22 226d 
< sup Tau = ul( | Pn +{ rs) + 2lul | Pn 
|s]<26 0 2n-6 v6 
227-6 

< 


sup |7,u — ul + 2|u| Pn 

js s 26 é 

Given e > 0, we first choose 6 > 0 so small that 6 < a and 
|Tu—u| <awe if |s| < 5. 


Then choose N so large that 


22-6 
2Iul | Q,<27e, non. 
é 
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Ifn > N, then 
lpn *u— ul <e., 
This proves the first assertion. Now suppose u € & For each k, 
D"(p, * U) = D*(u * py) = (D*u) * py = Pn * (D*u), 
which converges uniformly to D*y. Thus 
Gn ¥U—>U (FP). O 


In the next section we shall construct a sequence in F which is an approxi- 
mate identity. It will follow, using Proposition 3.4 and Theorem 3.6, that P 
is dense in @. 


Exercises 


1. Let e,(x) = exp (ikx), k = 0, +1, +2,.... These functions are in 7 
(see §6 of Chapter 2). Suppose ue @. Show that 


Cy ¥U = ACK, 


where 


—_ I a -—iky 
a =32| e~ u(y) dy. 


2. Show that e, *e_, = l,ande,*e_, =Oif7#k. 


§4. The Weierstrass approximation theorems 


A trigonometric polynomial is a function of the form 


(1) ox) = > a,exp (ikx), 


k=-n 


where the coefficients a, are in C. The reason for the terminology is that for 
k > 0, 


exp (+ikx) = [exp (+ix)]* = (cos x + isin x)*. 


Therefore any function of the form (1) can be written as a polynomial in the 
trigonometric functions cos x and sin x. Conversely, recall that 


cos x = 4[exp (ix) + exp (—ix)], 
sin x = 5; [exp (ix) — exp (—ix)]. 


Therefore any polynomial in cos x and sin x can be written in the form (1). 
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Lemma 4.1. There is a sequence (9,)? of trigonometric polynomials which 
is an approximate identity. 


Proof. We want to choose a non-negative trigonometric polynomial » 
such that 


(0) = 92m) = 1, 
g(x) < 1 for 0< x < 2z. 


Then successive powers of ¢ will take values at points near 0 and 27 which 
are relatively much greater than those taken at points between 0 and 27. 
We may take 


g(x) = 4(1 + cos x) 
and set 
9, (x) = c,(1 + cos x)” 


where c, is chosen so that 


20 
| 9, (x) dx = 27. 
0 


We need to show that for each 0 < 6 < 7a, 


22-6 

(2) | 9,(x)dx+0 as n—>0 
rs) 

There is a number r, 0 < r < 1, such that 

(3) 1 + cosx < r(1 + cos y) 

if 

(4) xe[d,27— 6], ye[0, 39). 


Then (3) and (4) imply 
p(x) = c,(1 + cos x)" < r",(y), 
SO 
1/26 
$6,(x) < "| g,(y) dy < 2ar", 
0 
or 
p(x) < 408-17", xe [5,27 — 8] 
Thus ¢, — 0 uniformly on [6, 27 — 6]. 


Lemma 4.2. If 9 is a trigonometric polynomial and ue @, then » * u is a 
trigonometric polynomial. 


Proof. This follows from Exercise 1 of the preceding section. [] 


Theorem 4.3. The trigonometric polynomials are dense in the space @ of 
continuous periodic functions, and in the space of smooth periodic functions. 
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That is, ifue @ and v & ©, there are sequences (u,)y and (v,)£ of trigonometric 
polynomials such that 


|u, — u| +0 
and 
Vz —>v (PY). 


Proof. Let (g,) be a sequence of trigonometric polynomials which is 
an approximate identity, as in Lemma 4.1. Let 


Un = Pn * U, Vn = Pn * UV. 


By Lemma 4.2, the functions u, and v, are trigonometric polynomials. By 
Theorem 3.6, u, — u uniformly and v, > vin the sense of AZ [ 


Note that if u, v are real-valued, then so are the sequences (u,)%°, (v,)2 
constructed here. 


Corollary 4.4. F is dense in ©. 


Theorem 4.3 is due to Weierstrass. There is a better-known approxima- 
tion theorem, also due to Weierstrass, which can be deduced from Theorem 
4.3. 


Theorem 4.5. (Weierstrass polynomial approximation theorem). Let u be 
a complex-valued continuous function defined on a closed interval [a, b] < R. 
Then there is a sequence (p,)¢ of polynomials which converges uniformly to u 
on the interval [a, b}. 


Proof. Suppose first that [a, b] = [0, 7]. We can extend wu so that it is a 
function in @; for example, let u(—x) = u(x), x € [0, 7] and take the unique 
periodic extension of this function. Then there is a sequence (u,)?° of trigono- 
metric polynomials converging uniformly to u. Now the partial sums of the 
power series 


> (m!)~+(ikx)™ = exp (ikx) 


converge to exp (ikx) uniformly on [0,7]. Therefore for each n, we may 
replace the functions exp (ikx) in the expression of the form (1) for u, by 
partial sums, so as to obtain a polynomial p, with 


| Pa(X) — u,(x)| < a7}, x € [0, 27]. 


Then p, — u uniformly on [0, 27]. 
In the case of an arbitrary interval [a, 5], let 


v(x) = u(a + (b — a)x/z), x € [0, ]. 


Then v is continuous on [0, z], so there is a sequence (q,)° of polynomials 
with g, —> v uniformly on [0, 7]. Let 


PAY) = Qnlw(y — ab -a)), yela, d]. 
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Then p,, is also a polynomial and p, — u uniformly on [a, 5]. 


Exercises 


1. Suppose u € @ and suppose that for each integer k, 
22 
| u(x) exp (ikx) dx = 0. 
0 


Show that u = 0. 
2. Suppose u: [a, b] —- @ is continuous, and for each integer n > 0, 


[ u(x)x" dx = 0. 


a 


Show that u = 0. 


§5. Periodic distributions 


In general, a “distribution” is a continuous linear functional on some 
space of functions. A periodic distribution is a continuous linear functional 
on the space A Thus a periodic distribution is a mapping F: A > C such 
that 


F(au) = aF(u), ae, ucF; 
F(u + v) = Flu) + Fv), u,vEeFP; 
F(u,) > F(u) ifu,—>u (PY). 


If v is a continuous periodic function defined on [0, 27], then we define a 
linear functional F = F, by 


(1) F(u) = ns i} " Wdu(x) dx,  ue@. 


Then F,: ¢ — C is linear, and 
| Foun) ~~ F(u)| S 2 [Un ~ ul. 


Therefore F,, is continuous on @. Its restriction to the subspace F is a periodic 
distribution. We say that a periodic distribution F is a function if there is a 
v €@ such that F = F,. If so, we may abuse notation and write F = v. 

Note that different functions v, we @ define different distributions. In 
fact, suppose F, = F,,. Choose (u,)? < FY such that u,—->w* — v* uni- 
formly, where w*(x) = w(x)*, the complex conjugate. Then 


0 = Inf Fuliy) — Foley) = f "(w(x) — 0(%))l_(x) dx > f " |w(x) — v(x)? de, 


SOW = Uv. 
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Not every periodic distribution is a function. For example, let 6: @ + C 
be defined by 


(2) 8(u) = u(0), ue. 


Then the restriction of 6 to F is a periodic distribution. It is called the 6- 
distribution, or Dirac 5-distribution. To see that it is not a function, let 
Un(x) = (4 + $cos x)". 


Then d(u,) = 1, all n. But u,(x)—0 uniformly for xe [e, 27 — e], any 
e > 0. Also 0 < u,(x) < 1, all x, n. It follows from this that for any v eG, 
F,(u,) > 0. Thus 6 # F,. 

The set of all periodic distributions is denoted by Y’. We consider 
as a vector space in the usual way: if F, GEA’, ue Bae, then 


(F + G)\(u) = F(u) + G(u), 
(aF)(u) = aF(u). 


Note that if v, w are continuous periodic functions, then 
Fy, + Fy, = Fyew, F., = aF,,. 
A sequence (F,)° © #' is said to converge to Fe ’ in the sense of F’ if 
F(u) > F(u), allueF 

We denote convergence in the sense of 7” by 

F,—>F (#), 
or simply by 

F,—-F 


when it is understood in what sense convergence is understood. 

We want to define operations of complex conjugation, reversal, transla- 
tion, and differentiation for periodic distributions. For any such operation 
there is a standard procedure for extending the operation from functions to 
distributions. For example, if ve C, the complex conjugate function v* is 
defined by 


v*(x) = v(x)*. 
Then 
Fy(u) = J } ” v(x)*u(x) dx = (x [° v(x)u*(x) dx)’ = (F,(u))*. 
27 Jo 27 Jo 
Then we define F* for an arbitrary Fe by 
(3) F*(u) = F(u*)*, ucZP 


Similarly, if v€ @ we define the reversed function 3 by 


i(x) = v(—x). 
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Then 
F,(u) = > | * (—x)ul(x) dx = - | ” (du(—x) de = F,(@). 


We define F~, Fe F, by 
(4) F~(u) = F(a), ucZF 
If ve and teER, recall that the translate T,v is defined by 
T,v(x) = v(x — ft). 
Then 


l 20 1 21 
wn | T;v(x)u(x) dx = a ) v(x — t)u(x) dx 
l 270 
_ | v(x)u(x + t) dx = F,(T_4). 
0 
We define T,F, Fe F’, by 
(5) T,F(u) = F(T_w), ucF 
IfvePAand uc FY, then integration by parts gives 


5 {- Dv(x)u(x) dx = a {- v(x) Du(x) dx = —F,(Du). 


We define DF, Fe F’, by 


(6) (DF)(u) = —F(Du), ucZ 
Then inductively, 
(7) (D*F)(u) = (—1)"F(D'u), ue F 


Each of the linear functionals so defined is a periodic distribution. For 
example, if u, > u (P) then Du, + Du (P). It follows that 


(DF )(un) = —F(Du,) > —F(Du) = DF(u), 


so the derivative DF is continuous. Similarly, F*, F, 7,F, and D*F are in #. 
In particular, let us take F = 6. Then 


(8) § = 8 =§ 
(9) T,5(u) = u(t), 
(10) D*8(u) = (—1)* D*u(0), ucP 


Proposition 5.1. The operations in #' defined by equations (3), (4), (5), 
and (6) are continuous, in the sense that if F, > F (#') then 


F,* > F* (FP), 
Fi” > F” (FP), 
TF, > TF (F), 
DF, —> DF (#’). 
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Proof. Each of these assertions follows trivially from the definitions. 
For example, if ue 7 then 


(DF,)(u) = —F,(Du) > —F(Du) = DF(u). 
Thus DF, —> DF (#), etc. 


Recall that if ue PY then Du is the limit of the “difference quotient” 
t-1(T_w — u). 


Proposition 5.2. If Fe #’, then 
(11) t-47_.F — F)>F (#7) 
as t—> 0. 
Proof. Suppose ué F By definition, 
(12) ¢°\(7_.F — F)u) = t-' F(T) — tt FW) = —F(t fu — Ty). 
Now 
(13) t~*[u — Tyul(x) = t~+[u(x) — u(x — £)]. 


An argument like that proving Lemma 3.2 and Corollary 3.3 shows that the 
expression in (13) converges to Du in the sense of FY as t-> 0. From this fact 
and (12) we get (11). @ 


As an example, 
t-*(T_,6 — 8)(u) = t7*[u(—t) — u(0)] ~ — Du(O) = (Ds)(u). 
The real and imaginary parts of a function v € @ can be defined by 
Rev = 4+(v + v*), 
Im v = 5, (0 — v*), 
Similarly, we define the real and imaginary parts of a periodic distribution F 
by 
Re F = 4(F + F*), 


i * 
Im F = 5 (F - FY), 


F is said to be real if F = F*. A function v € @ is said to be even if v(x) = 
v(—x), all x; it is said to be odd if v(x) = —v(—x), all x. These conditions 
may be written 


~w 


v= od, v= —0d. 
Similarly, we say a periodic distribution F is even if 
F=F'; 
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we say F is odd if 
F=—-F. 


Exercises 


1. Which of the following define periodic distributions? 


(a) F(u) = Du(1) — 3u(2z). 

(b) F(u) = J,* u(x)? dx. 

(c) F(u) = \." u(x) dx. 

(d) F(u) = [F u(x). + x)" dx. 

(e) F(u) = - \." D®u(x)|cos 2x| dx. 
(f) F(u) = 27-0 a;D’u(t,). 

(g) Fu) = 270 GU!) *D’u(0). 


2. Verify (8), (9), (10) 

3. Express the distributions in parts (a) and (f) of Exercise 1 in terms of the 
6-distribution and its translates and derivatives. 

4. Compute DF when F is the distribution in part (c) or (d) of Exercise 1. 

5. Show that Re F and Im F are real. Show that F = Re F + iImF. 

6. Show that F real and u real, ue Z imply F(u) is real. 

7. Show that F even and u odd, ue Z imply F(u) = 0. Show that F odd 
and u even, ue F imply F(u) = 0. 

8. Show that any Fe #’ can be written uniquely as F = G + H, where 
G, He F and G is even, Z is odd. 

9. Suppose that v € @ 1s differentiable at each point of R and Dv = w is 
in @. Show that 

D(F,) = Fy; 


in other words, if F = v, then DF = Dv. 

10. Suppose v is a continuous complex-valued function defined on the 
interval [0, 27] and that Dv = wis continuous on (0, 277) and bounded. Define 
Fi,éeF' by 


F(u) = = [" v(x)u(x) dx, ucZF 
Show that 
DF,(u) = (v(0) — v(27))u(O) + - i} ” w(x)u(x) dx. 


In other words, 
DF, = F,, + [v(0) — v(27)]6. 
11. Let v(x) = |sin 4x]. Compute 
(D*F,\(u), uEeF 
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§6. Determining the periodic distributions 


We know that any continuous periodic function v may be considered as a 
periodic distribution F,,. The derivatives D“F, are also periodic distributions, 
though in general they are not (defined by) functions. It is natural to ask 
whether all periodic distributions are of the form D*F,, v ¢ @. The answer is 
nearly yes. 


Theorem 6.1. Suppose F is a periodic distribution. Then there is an integer 
k = 0, a continuous periodic function v, and a constant function f, such that 


(1) Fe= D*F, + F;. 


The proof of this theorem will be given later in this section, after several 
other lemmas and theorems. First we need the notion of the order of a 
periodic distribution. A periodic distribution F is said to be of order k (k an 
integer > 0) if there is a constant c such that 


|F(@u)| < c{{u| + |Du] +---+ |D*u}}, allueF 


For example, 5 is of order 0. If ve @ then D*F, is of order k. It is true, but 
not obvious, that any Fe #’ is of order k for some integer k > 0. 


Theorem 6.2. Jf Fe #’, then there is an integer k = 0 such that F is of 
order k. 


Proof. If F is not of order k, there is a function u, € F such that 
|F(u,)| = K+ Wilma] + |Du,| +--+ + | D¥ uy}. 
Let 
v, = (kK + 17 {u,| + |Du,| +--+ + | D*u,|}- 2m, 


Then we have 


(2) |F(v,)| = 1, 
while 
(3) |v,,| + | Dv;,| + vee + | D¥v,| < (k + 1)7}, 


Suppose now that F were not of order k for any k = 0. Then we could find a 
sequence (v,,)¢-1 © F satisfying (2) and (3) for each k. But (3) implies 


Up, > 0 (P). 
Then (2) contradicts the continuity of F. Thus F must be of order k, some k. J 


Lemma 6.3. Suppose Fe #' is of order 0. Then there is a unique con- 
tinuous linear functional F,: @ —> © such that 


F(u) = F,(u), alueF 
Proof. By assumption there is a constant c such that 


|F(u)| <clu|, uweF 
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If ue @, there is a sequence (u,)r_-, © F such that u, > u uniformly. Then 
|F(un) — F(um)| < clu, — Um| > 0, 

so (F(u,))?-1 is a Cauchy sequence. Let 

(4) F,(u) = lim F(u,). 


We want to show that F,(u) is independent of the particular sequence used 
to approximate u. If (v,)? < A and v, > u uniformly, then 


lun — Vp] > 0 
SO 
|F(un) — FQn)| < ¢ltn — 0| > 0. 
Thus 
lim F(u,) = lim F(v,). 


The functional F,:@ —-@ defined by (4) is easily seen to be linear. It is 
continuous (= bounded), because 


|F(u)| = lim |F(u,)| < clim |u,| = clul. 


Conversely, suppose F,: @ — @ is continuous and suppose F,(u) = F(w), 
allueF For any ue@, let (u,)? < FY be such that u, — u uniformly. Then 


F.(u) = lim F,(u,) = lim F(u,) = F,(u). O 
(The remainder of this section is not needed subsequently.) 


Lemma 6.4. Suppose Fe #’ is of order 0, and suppose F(w) = 0 if w is 
a constant function. Then there is a function v € © such that 


D*F, = F. 


Proof. Let us suppose first that F = F;, where fe ©. We shall try to find 
a periodic function v such that D?v = f. Then we must have 


Dv(x) = Do(0) + } ft) dt =a + | f(t) dt, 
0 “0 
where a is to be chosen so that v is periodic. We may require v(0) = 0. Then 
x x t 
v(x) =| Dv(t) dt - | E + | f(s) as| dt 
0 0 0 
x pt 
= ax +| [ 1) ds dt. 
0 “0 
We use Theorem 7.3 of Chapter 2 to reverse the order of integration and get 


v(x) = ax + Moc — s) ds. 
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Let 
(x —s)* =0 if x < 5s, (x-—s)* =x-Ss ifx > s. 
Then 
Qn 
(5) v(x) = ax + f(s)(x — s)* ds. 


By assumption on f, 
20 
i} bf(s) ds = 0, beC. 
6) 


Now we want to choose a in (5) so that v is periodic. This will be true if 
v(2z7r) = 0, 1.e., 


0 = 27a + " (27 — s) f(s) ds = 27a — [ sis) ds. 


Thus 
a= ~ {- sf(s) ds 
and 
(6) ox) = slsytxs + n(x — 8)*] ds, 


Now suppose only that Fe # is of order 0 and that F(w) = 0 if w is 
constant. Let F, be the extension of F to a continuous linear functional on @. 
Let 


u,(s) = xs + 2n(x — s)*, 0O<s < 2z. 


Then u,(0) = 27x = u,(27). We can extend u, so that it is a continuous 
periodic function of s. Then (6) suggests that we define a function v by 


(7) u(x) = Fi(u;). 

We want to show that ve @ and D?F, = F. It is easy to check that 
|u,, — Uy| < 2m|x — yl, Ux son = Ux 

Therefore 


lo(x) — v(y)| < [Fi(ux) — Fiuy)| < clux — uy] < c2m|x — yI, 
v(x + 27) = F(x, +22) = F,(u,) — v(x). 


Thus ve @. Let us compute DF,. If we & then 


2nt 1[(T_.F, — F,](w) = rife [u(x + t) — v(x)]w(x) dx. 
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Approximate the integral by Riemann sums. These give expressions of the 
form 


(8) ¢-* > [v(x; + t) — v(2%j)]wOxs)(% — Xy-1) 
= Ff, (5 W(X) — Xy-a)t7*. [Maya — uel), 


since F, is linear. As partitions (Xo, X1,..., Xn) of (0, 27) are taken with 
smaller mesh, the functions on which F, acts in (8) converge uniformly to 
the function g;. Here 


ils) = i, t-[u,—(s) — ux(s)]w(x) dx. 


Now |t7*(u,-; — ux)| < 27. For fixed s€ (0, 27), and0 < x <5, 
t(uy4:—U,) >sS as t>0. 
This convergence is uniform for x in any closed subinterval of (0, s). Similarly, 
t Uns: — Ux) > S+ 27 as tO), 
uniformly for x in any closed subinterval of (s, 27). It follows that 


(9) 27 DF (w) = lim t-*[T_w — v\(w) = F,(g), 
where 


g(s) = s[ w(x) dx + an [ w(x) dx. 
Then 
2n(D?F,)(w) = —2n(DF,)(Dw) = —Fi(A), 
where 


h(s) = s| 


0 


2 2n 


Dw(x) dx + 27 Dw(x) dx = 4rw(27) — 27w(s). 
Since F, applied to a constant function gives zero, we have 
2n(D?F,)\(w) = —F,(A) = 27F,(w) = 27F(w). 
Thus D?F, = F. Q 


Lemma 6.5. Suppose Fe F' and suppose F(w) = 0 if w is a constant 
function. Then there is a unique GE #' such that DG = F and G(w) = 0 if 
w is a constant function. If F is of order k > 1, then G is of order k — 1. 


Proof. Ifue&Z it is not necessarily the derivative of a periodic function. 
We can get a periodic function by setting 


Su(x) = | ” u(t) dt — ea | ” U(t) dt = | ” u(t) dt — xF.(u), 
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where 
e(x) = 1, all x, 
Then 
D(Su) = u — FAu)e. 
It follows that if DG = F and G(e) = 0, then 
(10) Gu) = Gu — F.(uje) = G(D(Su)) = — DG(Su) = —F(Su). 


Thus G is unique. To prove existence, we use (10) to define G. Since S: P7 > F 
is linear, G is linear. Also 


|Su| < 4z|ul, 
| D(Su)| < 2|ul, 
|D*(Su)| = |D*-4uj, kk > 2. 


Then if u, —u (#) we have 
G(u,) = —F(Su,) > —F(Su) = Gu). 
Thus Ge #. Also 
DG(u) = —G(Du) = F(S(Du)) = F(u). 
Finally, suppose F is of order k > 1. Then 


|G(u)| = |F(Su)| < cf{|Su] + |DSu| +---+ |D*Sul} 
< S5ace{|u| + |Dul +---+ |D*~*u}}, 


and Gis of orderk — 1. [ 
Corollary 6.6. If Ge and DG = 0, then G = F,, where f is constant. 
Proof. Again let e(x) = 1, all x. Let f = (27)~*G(e)e, and 
H=G —- F,. 


Then DH = 0 and H(e) = 0. By Lemma 6.5 (uniqueness), H = 0. Thus 
G = F;. [] 


Finally, we can prove Theorem 6.1. Suppose Fc #’. Take an integer k 
so large that F is of order k — 2 > 0. Again, let e(x) = 1, all x, f= 
(27) ~1F(e)e, and 


Fo =F — F,. 


Then Fy is of order k — 2 and F,(e) = 0. By repeated applications of Lemma 
6.5 we can find F,, Fo,..., F,-2€F so that 


DF, = Fy-45 Fe) = 0, 


and F; is of order k — 2 — j. Then F,,_2 is of order 0. By Lemma 6.4, there 
isave@ such that D?F, = F,,_.. Then 
D*F, = D*-*F,_. = D'-1F,._, = ete Fo 
= F — F,. al 
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Exercises 
1. To what extent are the functions v and f in Theorem 6.1 uniquely 
determined? 
2. Find ve @ such that D?F, = F, where 
F= 56 —T,6, 


1.€., 
F(u) = u(O) — u(x). 


3. Find ve @ and a constant function f such that 


) = DF, + F,. 


§7. Convolution of distributions 


Suppose ve @ and ue FY. The convolution v * u can be written as 


(0 « w(x) = ue dQ) =e ) * ule — y)o(y) dy 


l 2n 
— =| v(y)i(y — x) dy = FT,f); 
0 


here again a(x) = u(—x). Because of this it is natural to define the convolu- 
tion of a periodic distribution F and a smooth periodic function uw by the 
formula 


(1) (F * u)(x) = F(T,f). 


Proposition 7.1. Jf F is a periodic distribution and u is a smooth periodic 
function, then the function F * u defined by (1) is a smooth periodic function. 
Moreover, 


(2) (aF)*u = a(F xu) = F x (au), FeF  ucBael; 
(3) (F+ G)*xu= Feut+ Gey, FeF ucF; 

(4) Fe(ut+v) = Feut+ Fx, FeF  uveF; 
(5) T(F xu) = (7,F)*u= F(T), FeF,uceP; 

(6) D(F xu) = (DF)*xu=F*(Du), FeF,ucF 


Proof. The identities (2)—(5) follow from the definition (1) by elementary 
manipulations. For example, 


T(F * u(x) = (F*u)(x — t) = F(T,-:fi) 
= F(T_,T,di) = (T,F)(T.@) = (T.F) * u)). 
Also, 
F(T, -u) = F(T,(Tu)”) = (F * Tw))Q). 
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This proves (5). It follows that 
(F* u(x + 2m) = (F * (T2,u))(x) = (F * u)(x). 
We know that 
t*[T_.F — F] > DF (7) as t-0. 
Therefore 


t*[(F * u(x + t) — (F«u)(x)] = t-[T_(F * u(x) — (Fx u)(x)] 
t-*(7_.F — TF](T,.u) > DF(T,u) 

= ((DF) * u)(x). 
This shows that F* u is differentiable at each point x € R, with derivative 


(DF) * u(x). By induction, D*(F * u) = (D*F)*u. Thus FxucP Finally, 
using (5) again, 


t*[(F «u(x + t) — (F*u(x)] = F* [t-(7_y — w(x) 
= F(T,[t-*(T_u — u)\”) > F(T.(Du)*) 
= (F * (Du))(x). 


By induction, D*(F * u) = F* (D*u), all k. We leave the proofs of (2), (3), 
(4) as an exercise. [ 


As an example: 
(7) S*uU = U, (D*8)*u = D*u, 


In using (1) to define F « u, we departed from the procedure in §4, where 
operations on distributions were defined in terms of their actions on functions. 
Suppose ue @, ve FY weF Then 


(8) Fu) = Fye(w) = Qn)? I " [ * ux — yo(y)w(x) dy dx 


= (2n)2 {soy [ay ~ m0 axl dy 
= Fai * w). 


This suggests that we could have defined F « u as a distribution by letting it 
assign to we the number F(a w). We shall see that this distribution 
corresponds to the function defined by (1). 


Lemma 7.2. [fue@ and ve, then w = uv is the uniform limit of 
the functions w,,, where 


(9) Wyn = (27)-? > n~*v(2a7m/[n)Tonmsnll- 
m=1 
Proof. Let Xmn = 2mm/n. Then it is easy to see that 


Zw) — W609) = > [— foCnnduCx — Xan) ~ oC )uCoe — y)] ay 


Xm-i,n 
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Now u, v are uniformly continuous and over the range of integration of the 
m-th summand, 
ly - Xmn| < 2m/n. 


Therefore |w, — w]| >Oasn—-oo. J 


Corollary 7.3. If ueF and ve, then the functions w, given by (9) 
converge to w = u*v in the sense of P asn—>o. 


Proof. Since D*(T,,u) = T,{D*u), D*w,, is the corresponding sequence of 
functions for (D*u) *« w = D*(u* w). Therefore D*w, — D*w uniformly as 
n—->o,foreachk. [J 


Proposition 7.4. If Fe F' andu,veF then 
F;,(v) — F(a * v), 
where 
f=Fru. 


Proof. Let w = &i*v and let w, be the corresponding function defined 
by (9), with @ replacing u. Then w, > @*v (#), so 


F(a * v) = lim F(v,) 
But 


F(W,) = Qn)"® >. n-40(Qnm|n)F Loan 
= = > v(2am/n) f(2a7m/n) = 


— [ v(x) f(x) dx = F,(0). i 


We shall now define the convolution of two periodic distributions F, G by 
(10) (Fx Gu) = F(G~ *u), uceF 


If G = F,, f = F * v, then Proposition 3.4 shows that F; = F * G. In general, 
we must verify that (10) defines a periodic distribution. Clearly F* G:F’ +C 
is linear. If u, — u (PW) and G is of order k, then 


\(G~ * un)(x) — G * u(x)| 
= |(G™ * (u, — u))(x)| 
|G~(T(un _ u)~)| 
c{|T(u, ~ u)| + | DT(u, _ u)| Feet | D'T,(u,, _ u)|} 
c{ju, — ul + |D(u, — w| +---+ |D*u, — u)}}. 


Thus G~ *u,—>G™~ *u uniformly. Similarly, for each j, D(G~ *u,) = 
G~ « Dju, > D(G™~ * u) uniformly. Thus 


G~ *u,—>G~ «u (PY), 


I IA Il 
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SO 
(F « G)(u,) > (F * G)(u). 

This shows that F* Ge #’. As an example, 

(11) 6*F= Fed = F, 


In the course of showing that F * G is continuous, we have given an 
argument which proves the following. 


Lemma 7.5. [f Fe#?’, (u,)~? © ¥ and u,—u (A), then Feu,—> 
Feu (A). 


Corollary 7.6. Suppose (f,)? <— 4 (g,)? < F and set 
F, = Fy G,, = Fy. 


Suppose 
F,—F (#7) and G,>G (#?). 
Then 
F,*G>F*«G (#) 
and 


F«G,>F+*G (#). 
Proof. Suppose ue & Then 
(F, * G)(u) = F,(G~ * u) > F(G~ * u) = (F « G\u) 
Also, G,~ * u—->G™~ xu (P) so 
(F «x G,)(u) = F(G,~ * u) > F(G~ * u) = (F *« G)(u). O 


We can now prove approximation theorems for periodic distributions 
analogous to those for functions. 


Theorem 7.7. Suppose (y,)° | F is an approximate identity, and suppose 
Fe’, Let F, = F,,, where f, = F * o,. Then F, > F (7). 

In particular, there is a sequence (f,)? of trigonometric polynomials such 
that F,, > F (#"). 


Proof. We have, by Proposition 7.4, 
F,(u) = F(, * 4), ucZ 
But (¢,)x-1 1S also an approximate identity, so 
Pn ¥U—>U (A). 
Therefore 


F,(u) — F(u). 
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If (p,)@=1 is an approximate identity consisting of trigonometric poly- 
nomials, then the functions f, = F * 9, are also trigonometric polynomials. 
In fact, let 


e,(x) = exp (27ikx). 


Then 
(F * e,)(x) = F(Txe;). 
But 
(Txen)(y) = ex(x — y) = ex{x)en(y), 
SO 
(F « e,)(x) = F(ex)ex(x). 
Thus 


F(> Mx) = > a,,F'(e;,)ex 
is a trigonometric polynomial. {J 


Finally, we prove the analog of Proposition 7.1 and Proposition 3.1. 


Proposition 7.8. Suppose F, G, He #’,aeC. Then 


(12) FeG=GrF, 

(13) (aF) * G = a(F * G) = F * (aG), 
(14) (F+G)+H=FeH+ Ged, 

(15) (Fx G)* H = Fx(G* A), 

(16) T(F * G) = (TF) * G = Fx (7,6), 
(17) D*(F « G) = (D*F) * G = F * (D*6). 


Proof. All of these identities except (12) and (15) follow from the 
definitions by a sequence of elementary manipulations. As an example, we 
shall prove part of (16): 


[TF * G)\(u) = F* GT_w) = F(G” * T_w) 
= F((T_;G~) * u) 
= F((7,G)~ * u) = (F * T,G)u). 
Here we used the identity 
(18) (7,G)" = T_{G"). 
To prove (12) and (15) we use Theorem 7.7 and Corollary 7.6. First, suppose 
G=F,,g¢€B Take (f,)r © F such that 


F, = F,,>F (#’. 
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Let 

hn = Sn * 2) H,, = F,,,. 
It follows from (8), (10), and Corollary 7.6 that 

H, = F,* G>F*G (#). 
But 

hn = & * Sn» 

so also 

H, =G*F,—>G#F (#). 


Thus (12) is true when G = F,, g¢ F In the general case, take (g,)° © F 
so that 
G, = Fy, > G. 


Then, in the sense of 7, 
FeG=lmF*G, = limG,* F = Gx F. 
The proof of (15) is similar. In the first place, (15) is true when 
F = F,, G = F,, H=F,, 
since 
Fx«(G* A) = F* Fon = Fragen) = Foegyen = (F * G) * A. 


We then approximate an arbitrary F by F;,, and get (15) when G = F;, 
H = F,. Then approximate G, H successively to get (15) forall F, G, He F#. 
The rest of the proof is left as an exercise. 


Exercises 


1. Prove the identities (2), (3), (4). 

2. Prove the identities (7), (11). 

3. Prove the identities (13), (14), (16), (17), (18) directly from the de- 
finitions. 

4. Prove the identities in Exercise 3 by approximating the distributions 
F, G, H by smooth periodic functions. 


§8. Summary of operations on periodic distributions 


In this section we simply collect for reference the definitions and results 
concerning Y’. The space F is the set of infinitely differentiable periodic 
functions u: R—> C. We say 


u,—>u (PY) if |D*u, — D*ul|>0,  allk. 
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A periodic distribution is a mapping F: A — C with 


Flu + v) = F(u) + Fv), F(au) = aF(u), 
F(u,) > F(u) if u,ou (PY). 


If v € G, the space of continuous periodic functions, then F,, ¢ ’ is defined by 


F,(u) = 5 ) ” v(x)u(x) dx. 


The 6-distribution is defined by 
d(u) = u(0). 


The sum, scalar multiple, complex conjugate, reversal, and translation of 
distributions are defined by 


(F + G)(u) = F(u) + Gt), 
(aF)(u) = aF(u), 
F¥*(u) = (F(u*))* = (u*(x) = u(x)*), 
F°(u)= F@ = (ax) = u(—)), 
(T,F)(u) = F(T_ wu) (Tyu(x) = u(x — ¢)). 


Derivatives ate defined by 
(D*F)(u) = (—1)*F(D*u). 
We say 
F,7>F @) 


F,(u) > Flu), alluceF 
In particular, 


t4(7_,F — F) > DF as t—0 


Then 
6 = 6* = 6~ 
(T,8)(u) = u(t), 
(D*5)(u) = (—1)* D*u(0). 
If ve? then 
(F,)* = L yt, 
(F,)~ = F;, 
T(F,) = F,, where w = Ty. 
IfveS 


D*(F,) = F, where w= D*v, 
The convolution F * u is the function 


(F x u(x) = F(T,a), ucZf 
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Then Fx ue If veG@, then 
Fyiy*u=vx*u, 
If (9,)£ © Ais an approximate identity, then 
Fxo,—>F (FW). 
More precisely, 
F,,>F (#), where f, = F* o,. 
In particular, 
O*uU =U, ucZ 
The convolution F * G is the distribution 
(Fx G)(u) = F(G™ *u), ucF, 
In particular if ve F then 


Fx F,=F,, where f= Fv. 


Clearly 
6%*F= Fed = F, 
If 
Fi,—>F (#) 
then 


F,*G—>F*G (#). 
The convolution of distributions satisfies 


FeG=GeF, 
(aF) * G = a(F* G) = F * (aG), 
(F+G)*H=F*H + GH, 
(Fx G)« H = F*«(G* A), 
TF * G) = (7,F)* G = F * (7,6), 
D*(F « G) = (D‘F) * G = Fx (D'*G). 
A periodic distribution F is real if F = F*. Any Fe #’ can be written 
uniquely as 
F=G+4+ iH, G, H real. 
In fact 
G=ReF=4(F + F*), 


H= Im F = 5,(F — F*) 


A periodic distribution F is even if F = F~ and odd if F = —F”. Any 
Fe can be written uniquely as 


F=G+4+4H, G even, H odd. 
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In fact 
G=1(F + F’), 
H=43(F — F’). 


The 6-distribution is real and even. 
A periodic distribution is of order k if there is a constant c such that 


|F(u)| < c(jul + |Du| +---+ |D*ul), allueA 


Any Fe #' is of order k for some k. 
If ve @ and fis constant, 
DF, + F,eF. 
Conversely, if Fe # is of order k + 2, k > O, then there are v € @ and con- 


stant function f such that 


PF = D*F, + F;. 


Chapter 4 


Hilbert Spaces and Fourier Series 


§1. An inner product in @, and the space L? 


Suppose uw and v are in @, the space of continuous complex-valued periodic 
functions. The inner product of u and v is the number (u, v) defined by 


(1) (u, v) = - {- u(x)o(x)* de. 

It is easy to verify the following properties of the inner product: 

(2) (au, v) = a(u, v) = (u, a*v), 

(3) (u, + Ug, Vv) = (Uy, v) + (Ua, V), 

(4) (u, Vv; + Vg) = (u, v1) + (U, Vo), 

(5) (v, u) = (u, v)*, 

(6) (u, v) > 0, (u, u) = O only if u = 0. 
We define |u|] for ue @ by 

1) Jul = (wwe = (Ff nol? ax) 

2m Jo 

Lemma 1.1. Jfu,veE@, then 

(8) (u, v)| < |lull lo}. 


Proof. Ifv = 0 then 
(u, v) = (u, Ov) = O(u, v) = 0, 
and (8) is true. Suppose v # 0. Note that for any complex number a, 
(9) O<(u—av,u — av) = (u,u) — (av, u) — (u, av) + (av, av) 
= |lul? — alu, v)* — a*(u, v) + al? lol]. 
Let 
a = (u, v)|v||~?. 
Then (9) becomes 
0 < |ul? — 2|(, v)/*Io]-? + [@, o)/? [oll -7, 

and this implies (8). Q 

The inequality (8) is known as the Schwarz inequality. Note that only the 


properties (2)-(6) were used in the proof, and no other features of the inner 
product (1). 
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Corollary 1.2. The function u —> ||u|| is a norm on @. 


Proof. Recall that this means that ||u|| satisfies 


(10) |u| > 0, iw} =O  onlyifu=0, 
(11) |u|] = fal lull, ae, 
(12) je + ol < [lull + fof. 


Property (10) follows from (6) and property (11) follows from (2). To prove 
(12), we take the square and use the Schwarz inequality: 


|u|]? + (u,v) + (v, uw) + oll? 


< ful? + 2ful) Jol + fol? 
= (lull + Jol)? ' 


ju + vj? =(utov,u + v) 


The new norm on @ is dominated by the preceding norm: 
(13) |u| < |u| = sup {|u(x)}}. 


It is important to note that @ is not complete with respect to the metric 
associated with this new norm. For example, let u,: R — R be the periodic 
function whose graph contains the line segments joining the pairs of points 


Then 


SO (u,)? is a Cauchy sequence in the new metric. However, there is no ue 
such that 
lu, — ul| +0. 


In order to get a complete space which contains @ with this inner product, 
we turn to the space of periodic distributions. Suppose (u,)° < @ is a 
Cauchy sequence with respect to the metric induced by the norm ||ul|, ice. 


un, — Uml| ~O as n,m-—>oo, 
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Let (F,,)? be the corresponding sequence of distributions: 
F, = F,,. 
Thus ifveF 


Flo) = 3 | uaGa)0G) ae = (a 


where again u* denotes the complex conjugate function. By the Schwarz 
inequality, 
|F.(v) — Fr(v)| = |(v, uf — um)| < lel] en — um 


< |v| |lu, — un. 
Therefore (F,(v))§ has a limit. We define 
(14) F(v) = lim F,(v). 


The functional F: A > @ defined by (14) is clearly linear, since each F, is 
linear. In fact, F is a periodic distribution. To see this, we take N so large 
that 
lun —Un|| <1 ifn,me Nn. 
Let 
M = max {|u,||, |uel],..., uy} + 1. 


Then for anyn < N, 


unl < M, 
while ifn > N, 
[unl] = [un — uy + Uy] < lun — ull + unl 
<1+ |uy| < M. 
Therefore 
|FA(v)| = |, ur)| < lle unl) < Mol, 
Xe) 


|F()| = lim |F,@)| < MoI. 
We have proved the following lemma. 


Lemma 1.2. Jf (u,)2 <@ is a Cauchy sequence with respect to the 
norm ||u||, then the corresponding sequence of distributions 


F, = F,, 
converges in the sense of F' to a distribution F, which is of order 0. 


It is important to know when two Cauchy sequences in @ give rise to the 
same distribution. 


Lemma 1.3. Suppose (u,)f | © and (v,)f < © are Cauchy sequences 
with respect to the norm |u|. Let F, = F,, and G, = F,, be the corresponding 
distributions, and let F, G be the limits: 


F,>F (#) and G,>G (P?. 
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Then F = Gif and only if 
||u, — Up|| > 0. 


Proof. Let w, = u, — v, and let H, = F, — G, = F,,. We want to 
show 


H,—>0 (F’) if and only if ||w,|| > 0. 


Suppose 
|w,|| —> 0. 
Then for any ue F 
|H,(u)| = |(u, wn)| < [lull || all > 0. 
Conversely, suppose 
(15) H,—>0 (FP). 


Given « > 0, take N so large that n,m > N implies 
(16) lw, ~~ Wl = (un ~~ Um) + (0, ~ Um) | sé. 
Fix, m > N. Then ifn > N we use (16) to get , 


! Wm ||? — (Wns Wm) = (Wns Wm — Wn) + (Wns Wr) 
_ (Wns Wn — Wn) + (Wm) 
< ela + |An(n)|- 


Letting n —> oo, from (15) we get 


|Wnll? < elm, 
or 
[Wall <e men. 0 


We define L? to be the set consisting of all periodic distributions F with 
the property that there is a sequence (u,)? such that 


lun — Un >O as n,m—>o, 
F,,2>F (PF). 


If (u,)? < @ is such a sequence, we say that it converges to F in the sense of 
L? and write 
u, > F (L?). 


Lemma 1.2 can be rephrased: if 
u,—>F (L*), v0, > G (L’), 
then 
F=G — if and onlyif |ju, — v,|| +0. 
Clearly L? is a subspace of # in the sense of vector spaces. In fact, if 


u,—>F (L?) and v,—->G (L?), 
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then 
au, — aF (L?), Un + UV, > F + G (L?). 
We may extend the inner product on @ to L? as follows. If 
u,—> F (L?), v0,» G (L?), 
let 
(17) (F, G) = lim (u,, v,). 


The existence of this limit is left as an exercise. Lemma 1.2 shows that the 
limit is independent of the particular sequences (u,)? and (v,)?. That is, 
if also 


u, > F (L?), v’ >G (L?) 
then 
(18) lim (uy, Vp) = lim (uj, v;). 


Theorem 1.3. The inner product in L? defined by (17) satisfies the identities 
(2), (3), (4), (5), (6). If we define 


(19) |F | = F)*?, 
then this is a norm on L?. The space L? is complete with respect to this norm. 


Proof. The fact that (2)-(6) hold is a consequence of (17) and (2)-(6) 
for functions. We also have the Schwarz inequality in L?: 


(A, G)| s [FI Gl. 


It follows that ||/F'|| is a norm. 


Finally, suppose (F,,) < L? is a Cauchy sequence with respect to this 
norm. First, note that if 


u, > F (L?) 
and ve @, and if we take v,, = »v, all n, then 
|F, — F |? = lim |v, — u,||? = lim |v — u,||?. 
It follows that 
Fu, — F\?+~0 as no, 


i.e., F can be approximated in L? by functions. Therefore, for each n = 
1,2,... we can find a function v, € @ such that 


|Fn — F,,|| < n7?. 
Then 


[Pn — Omll = Fog — Fomll < Fo, — Fall + Fa — Fall + [Fm — Fog 
< |F, — Fal +n7*+m-*+>0 as n,m—->oo., 
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Thus there is an F € L? such that 


v, > F (L?). 
But then 
[Fn —_ F| < Fn — F,,|| + |F., _ F| 
<nott |F., — F|| +0. 0 
Exercises 


1. Carry out the proof that @ is not complete with respect to the norm 
ul. 

2. Show that the limit in (17) exists. 

3. Show that (18) is true. 

4. Suppose /: [0, 277] — R is such that 


f(x)=1, xel[a, bd), 
f(x) =0, x¢[a, d). 
Define 


F(v) = - { foow(x) dx = = | “ux dx, bEP 


Show that Fe L?. 
5. Suppose /: [0, 27]-+ C is constant on each subinterval [x;-_,, x;), 
where 


O = X% < X1 <°++< xX, = 27. 
Define 


F(v) = ~ { " FS(x)v(x) dx. 


Show that Fe L?. 

6. Show that 4, the 6-distribution, is not in L?. 

7. Show that if FeL? there is a sequence (u,)? of smooth periodic 
functions such that 


u, —> F (L?). 
8. Let 7; denote translation. Show that if ue @ then 
Tu — Tul|>0O as t—s. 
If F € L?, show that 
|7,.F —7,F||+0O as t—s. 
9. For any Fe L?, show that 
Fl = sup {/F@)| | we F [ul] < 1. 
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§2. Hilbert space 


In this section we consider an abstract version of the space L? of §1. 
This clarifies the nature of certain theorems. In addition, the abstract version 
describes other spaces which are obtained in very different ways. 

Suppose H is a vector space over the real or complex numbers. An 
inner product in H is a function assigning to each ordered pair of elements 
u, ve Ha real or complex number denoted by (u, v), such that 

(au, Vv) = a(u, V), aa scalar, 
(u, + Ug, V) = (U;, v) + (Us, ¥), 
(v, u) = (u, v)* ((v, uw) = (u, v) in the real case), 
(u,u) > 0 ifu 4 0. 


The argument of Lemma 1.1 shows that 


(1) lu, v)| < lull Ivf, 
where 
(2) jul) = (a, u)”?. 


Then |u|] is a norm on H. If H is complete with respect to the metric associ- 
ated with this norm, then H 1s said to be a Hilbert space. In particular, L? 
is a Hilbert space. Clearly any Hilbert space is a Banach space. 

A more mundane example than L? is the finite-dimensional vector space 
C* of N-tuples of complex numbers, with 


N 
(a, b) = > abr 
n=1 
when 
a= (a, AQ, .+ +5 ay), b= (b,, be, ..., by). 
In this case the Schwarz inequality is 


N N 
< > lanl? >, |nl?. 
n=1 n=1 


N 


> a,,b* ° 


n=1 


Notice in particular that if we let 


a’ = ({a,|, |az|, a) lay), b’ = (|b,|, [bo], ne [by |), 
then 


(> lanl Jal) < > lanl? > [bal 


A still more mundane example is the plane R?, with 
a-b = a,b, + Aabe 


when a = (1, dz), b = (61, ba); here we use the dot to avoid confusing the 
inner product with ordered pairs. It is worth noting that the law of cosines 
of trigonometry can be written 


(2) a-b = |a| [b| cos @, 
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where 6 is the angle between the line segments from 0 to a and the line 


segment from 0 to b. Therefore |a-b| = |a| |b| if and only if the segments 
lie on the same line. Similarly, a-b = 0 if and only if the segments form a 
right angle. 


Elements u and v of a Hilbert space H are said to be orthogonal if the 
inner product (u, vy) is zero. In R? this means that the corresponding line 
segments are perpendicular. We write 


uly 


when u and v are orthogonal. More generally, u € H is said to be orthogonal 
to the subset S ¢ Hif 


uly, allve S. 
If so we write 
ul! S. 
If u | v then 


(3) ju + vi? = w+ v,u+ v) = (uu) + (u,v) + (V,) + (V, y) 
= |ul? + |v? 
In R?, this is essentially the Pythagorean theorem, and we shall give the 


identity (3) that name in any case. Another simple identity with a classical 
geometric interpretation is the parallelogram law: 


(4) Ju — vl? + Ja + vi? = 2]ul? + 2iIv/?. 


This follows immediately from the properties of the inner product. In R? 
it says that the sum of the squares of the lengths of the diagonals of the 
parallelogram with vertices 0, u, v, u + v is equal to the sum of the lengths 
of the squares of the (four) sides. 

When speaking of convergence in a Hilbert space, we shall always mean 
convergence with respect to the metric associated with the norm. Thus 


u,—u means |u, — ul >0. 
The Schwarz inequality shows that the inner product is a continuous function. 
Lemma 2.1. Jf (u,)?, (Va) < H and 
u, > U, Vv, —> VY, 
then 
(u,, Vn) — (U, ¥). 


Proof. Since the sequences converge, they are bounded. In particular, 
there is a constant M such that |v, || < M, all. Then 


|(Uns Vn) _ (u, y)| = |(u, — U, Vn) + (u, Vn — y)| 
< ju, — ull [vail + [all lv. — y| 


< M|u, — ull + |lull |lv, — vl 0. 0 
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Corollary 2.2. JfueH, S < H, andu | S, then u is orthogonal to the 
closure of S. 


The general theory of Hilbert space essentially rests on the following 
two geometric lemmas. 


Lemma 2.3. Suppose H, is a closed subspace of the Hilbert space H, 
and suppose u € H. Then there is a unique v € H, which is closest to u, in the 
sense that 


ju-—v| < ju-—wi, allweHy,. 
Proof. The set 
{lu — wi] | we H;} 


is bounded below by 0. Let d be the greatest lower bound of this set. For each 
integer 7 > O there is an element v, € H, such that 


ju—v,| <d+n-%. 


If we can show that (v,)? is a Cauchy sequence, then it has a limit ve H. 
Since H, is closed, we would have v € H, and |u — v|| = das desired. 

Geometrically the argument that (v,,)? is a Cauchy sequence is as follows. 
The midpoint 4(v, + v,,) of the line segment joining v,, and v,, has distance 
> d from u, by the definition of d. Therefore the square of the length of 
one diagonal of the parallelogram with vertices u, V,, Vm, Yn + Vm iS nearly 
equal to the sum of squares of the lengths of the sides. It follows that the 
length |v, — V,|| of the other diagonal is small. Algebraically, we use (4) 
to get 


0 < [Vn — Yl? = 2[[¥vn — ul? + 2[¥vn — wl? — [Va + Yn) — 20? 
< 2d + n-+)? + 2d + m-1)? — 4]4(v, + Vn) — ull? 
< 2(d + n~1)? + 2(d + m1)? — 4d? —0. 


To show uniqueness, suppose that v and w both are closest to u in the above 
sense. Then another application of the parallelogram law gives 
Ju — 3(v + w)|? = Flu — vi? + $iu — wi? — av — wl? 
= 2 — |v —w/? 

Since the left side is > d?, we must have v = w. [J 

As an example, take H = R?, H, a line through the origin. The unique 
point on this line closest to a given point u is obtained as the intersection of 
H, and the line through u perpendicular to H,. This connection between 


perpendicularity (orthogonality) and the closest point is also true in the 
general case. 


Lemma 2.4. Under the hypotheses of Lemma 2.3, the element v € H, is 
closest to u if and only if 


u—vli H;,. 
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Proof. First, suppose v € H, is closest to u, and suppose we H,. We 
want to show (u — v, w) = 0, and we may assume w # 0. Let u, = u — ¥. 
For any ae C, v + awe Hj. Therefore 


ju, — aw||? = lu — (v + aw) ||? > ju — vi? = Ju |?, 


or 
Ju, |? — (us, aw) — (aw, us) + (al? wi? > us|? 

Let 

(5) a = (u,, w)||w||~?. 


Then (5) becomes 
— |(u, w)/? [lw]? = 0. 


Thus (u,, w) = 0. 
Conversely, suppose u — v | H,, and suppose w € H,. Then vy — we H,, 
SO 


(u—v) | (v— Ww). 
The Pythagorean theorem gives 
ju — wl? = |@ — v) + ( — w)/? = Ju — vi? + |v — wi? 
> lu — v2. O 


Corollary 2.5. Suppose H, is a closed subspace of a Hilbert space H. 
Then either H, = H, or there is a nonzero element ue H such that u | H,. 


Proof. If H, # H, take u.¢H, uy ¢H,. Take v, € H, such that Vo 1S 
Closest to up. Then u = uy — Vo iS nonzero and orthogonal to H. [{ 


As a first application of these results, we determine all the bounded 
linear functionals on H. The following theorem is one of several results 
known as the Riesz Representation Theorem. 


Theorem 2.6. Suppose H is a Hilbert space and suppose ve H. The 
mapping L,: H — C (or R) defined by 


Lu) = (u,v), weH, 


is a bounded linear functional on H. Moreover, if L is any bounded linear 
functional on H, then there is a unique vy € H such that L = Ly. 


Proof. Clearly L, is linear. By the Schwarz inequality 
[Ly(u)| <= vil [ul 
Thus L, is bounded. 


Suppose L is a bounded linear functional on H. If L = 0 we may take 
v = 0. Otherwise, let 


H, = {uc H | Lu) = 0}. 
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Then H, is a subspace of H, since L is linear; H, is closed, since L is con- 
tinuous. Since L # 0, H, is not H. Take a nonzero ueé H which is orthog- 
onal to H,, and let 


v = |ul~*Z@u. 
Then also v | H,, so 
L(w) = L(w), we H,. 
Moreover, 
Ly(u) = |u| -?Z(u)(u, u) = L(u). 
If w is any element of H, 
w — L(u)~‘L(w)u ec Hy. 
Thus any element of H is of the form 
au + W; 


for some ae C (or R) and w, € Hj. It follows that Ly = L. 
To show uniqueness, suppose L, = L,. Then 


0 = Lv — w) — La(v — w) = |v — w|]?. O 


Exercises 


1. Prove (2). 


2. Suppose Fe #’. Show that F ¢ L? if and only if there is a constant c 
such that 
|F(@u)| < clu, allueF 


3. Let H be any Hilbert space and let H, be a closed subspace of H. Let 
H, = {uc H|u | H;}. 


Show that H, is a closed subspace of H. Show that for any ue H there are 
unique vectors u, € H, and u, € Hg such that 


u = U, + Ub. 


§3. Hilbert spaces of sequences 


In this section we consider two infinite dimensional analogs of the finite 
dimensional complex Hilbert space C”. Recall that if 


x = (a;, Qo,.. ., ay) € C* 
then 


Ixl? = > Jal? 


n=1 
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Let /,.2 denote the set of all sequences 


x = (a,) <C 


such that 
(1) > lan |? < 00, 
n=1 
If 
x = (a,)? el,?, y = (b,)2 e1,?, 
we set 
(2) (x, y) = > a,Dx, 


1 


n 


provided this series converges. 


Theorem 3.1. The space 1,? of complex sequences satisfying (1) is a 
Hilbert space with respect to the inner product (2). 


Proof. Suppose 
x=(a,)p €/,7 and y= (6,)P el,” 
If ae C, then clearly 
ax = (aa,)? € 1,2. 
As for x + y, we have 
>, an + bal? < D (lanl? + 2[anda| + [Bnl”) 
< 2> |an|? + 2> |b, |? < 0. 


Thus /,? is a vector space. To show that the inner product (2) is defined for 
all x, ye /,7, we use the inequality (1) from §2. For each N, 


> lavbtl < (> lanl?) (> (640) 
<(> lal) (> tr) 


1/2 


1/2 


Therefore 
[o 0) 
> land*| < x 
n=1 


and (2) converges. It is easy to check that (x, y) has the properties of an inner 
product. The only remaining question is whether /,? is complete. 
Suppose 


_— 2 _ 
Xm = (Gnn)na1 © ly ? m= 1,2,.... 
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Suppose (x,,)~ 1S a Cauchy sequence in the metric corresponding to the 
norm 


|x|] = (x, x)”. 
For each fixed n, 
l@m,n — Ap,n|? < ||Xm — Xp? > 0 
as m, p —> 00. Thus (a,,,)=1 is a Cauchy sequence in C, and 
Anna, as m—->O, 


Let x = (a,)?. We want to show that xe/,? and |x,, — x|| +0. Since 
(x,,)7 18 a Cauchy sequence, it is bounded: 


|xXnl| < K, all m. 


Therefore for any N, 


N N 
> lanl? = lim > lamin? < K?. 
n=1 MAO n=] 
Finally, given « > 0 choose M so large that m, p => M implies 
Xm — Xp] < e. 
Then for any N and any m => M, 


N N 
> lQm.n — 4n|? = lim > lam.n — Qp.n|? < &. 
n=1 p>O n=1 


Thus 
[Xn —xX| <e¢ ifm2> M. O 

It is often convenient to work with sequences indexed by the integers, 
rather than by the positive integers; such sequences are called two-sided 
sequences. We use the notation 

X = (G,)e wo = (..., Ag, A-1, Ap, Ay, Ag,...). 
Let /? denote the space of two-sided sequences 
Xx =(a,)2. <C 

such that 


ic.¢) 


(3) > lanl? < 0. 


n=—-@ 
Here a two-sided infinite sum is defined to be the limit 
oO N 
c, = lim Cc 
2. . we ey " 


if this limit exists. 
If 


X= (a)20o C12, yu=(b)2o. <2, 
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we let 


(4) «y= > abt, 


n=-@ 


provided the series converges. 


Theorem 3.2. The space 1? of two-sided complex sequences satisfying (3) is 
a Hilbert space with respect to the inner product (A). 


The proof of this theorem is very similar to the proof of Theorem 3.1. 


Exercises 


1. Prove Theorem 3.2 
2. Let e,, € 1,7 be the sequence 


en = (Gnn)n=1 . 
with an., = Oif m 4 n, ay, = 1. Show that 


(a) |lenl| = 1; 

(b) e,, | e, if m # p; 

(c) (x, e,) 0 as m > ©, for each x € /,?; 

(d) the set of linear combinations of the elements e,, is dense in /,?; 
(e) if x e/,? there is a unique sequence (b,) © C such that 


N 

x — > b,en 
n=1 

3. Show that the unit ball in /,?, the set 

B = {xel,?| |x] < 0, 


—>Q0 as N->oo. 


is closed and bounded, but not compact. 
4. Show that the set 


C = {xel,?|x = (a,)f, each |a,| < n-4 


is compact; C is called the Hilbert cube. 


$4, Orthonormal bases 


The Hilbert space C” is a finite dimensional vector space. Therefore any 
element of C% can be written uniquely as a linear combination of a given 
set of basis vectors. It follows that the inner product of two elements of C% 
can be computed if we know the expression of each element as such a linear 
combination. Conversely, the inner product makes possible a very convenient 
way of expressing a given vector as a linear combination of basis vectors. 
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Specifically, let e, e C*” be the N-tuple 
e, = (0,0,...,0,1,0,..., 0), 


where the | is in the n-th place. Then {e,, e2,..., ey} is a basis for C”. More- 
over it is clear that 


(1) (e,,¢,) = 1 ifn=m, (e,,€n) = 0 ifnAm. 


If x = (a, dg,..., Ay) € C% then the expression for x as a linear combination 
of the basis vectors e,, is 


N 
(2) x = > An€n: 
n=1 
Because of (1), 
(x, Cn) = An. 


Thus we may rewrite (2) as 


(3) x = (x, e,)e,. 


Maz 


n=1 


If x = (a1, do,..., ay) and y = (b,, bg, ..., by), then 


N 
(x, y) = > andr. 
n=1 


Using (3) and the corresponding expression for y, we have 


(4) (%¥) = 2% edly en)* =D, (% n)(Cn» ¥), 
In particular, 
(5) Ixi? = > lel? 


The aim of this section and the next is to carry this development over 
to a class of Hilbert spaces which are not finite dimensional. We look for 
infinite subsets (e,)° with the properties (1), and try to write elements as 
convergent infinite sums analogous to (3). 

A subset S of a Hilbert space H is said to be orthonormal if each ue S 
has norm 1, while 


(u,v) = 0 ifu, ve S,u # v. 


The following procedure for producing orthonormal sets is called the Gram- 
Schmidt method. 


Lemma 4.1. Suppose {u,, Us, ...} is a finite or infinite set of elements of 
a Hilbert space H. Then there is a finite or infinite set S = {e;, €o,...} of 
elements of H such that S is orthonormal and such that each u, is in the sub- 
space spanned by {e,, €2,..., €n}: 
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(if S has m elements, m < oo, we interpret the statement as saying that 
u, € span {e;,..., e,} when n > m.) 


Proof. The proof is by induction. If each u; = 0, we may take S to be 
the empty set. Otherwise let v, be the first nonzero u,, and let 


e, = |v.) ~*v3. 


Then {e,} is orthonormal and wu, € span {e,}. Suppose we have chosen 
e,,...,@, such that {e,,...,e,} is orthonormal and w,,...,u,¢€ 
span {e,,...,@n}. If each u,; € span {e,,...,e,,} we may stop. Otherwise 
choose the first 7 such that u, is not in this subspace. Let 


N 
Vnt+1 — Uy — > (u,, e,). 
n=1 


Since {e,,..., €,} iS orthonormal, it follows that 


(WYn+i€n) =90, IIlene<m. 
Since u, ¢ span {e,,..., Cn}, Yn+1 # 0. Let 
Cm+1 = Vn +1 | ~>Vin +1: 
Then {e,,...,@n41} iS orthonormal and u,,,, is in the span. Continuing, 


we get the desired set S. {J 


Note that completeness of H was not used. Thus Lemma 4.1 is valid in 
any space with an inner product. 

An orthonormal basis for a Hilbert space H is an orthonormal set S <¢ H 
such that span (S) is dense in H. This means that for any ue H and any 
e > O, there is a v, which is a linear combination of elements of S, such that 
ju — vi <e. 

A Hilbert space H is said to be separable if there is a sequence (u,)? © H 
which is dense in H. This means that for any ue H and any «e > 0, there is 
an n such that |u — u,| < e. 


Theorem 4.2. Suppose H is a separable Hilbert space. Then H has an 
orthonormal basis S, which is finite or countable. 

Conversely, if H is a Hilbert space which has a finite or countable ortho- 
normal basis, then H is separable. 


Proof. Suppose (u,)? is dense in H. By Lemma 4.1, there is a finite or 
countable orthonormal set S = {e,, e.,...} such that each u, is a linear 
combination of elements of S. Thus S is an orthonormal basis. 

Conversely, suppose S is a finite or countable orthonormal basis for H. 
Suppose H is a complex vector space. Let T be the set of all elements of H 


of the form 
N 
u= > ay 
n=l 
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where WN is arbitrary, the e, are in S, and the a, are complex numbers whose 
real and imaginary parts are rational. It is not difficult to show that T is 
countable, so the elements of 7 may be arranged in a sequence (u,)?. Any 
complex number is the limit of a sequence of complex numbers with rational 
real and imaginary parts. It follows that any linear combination of elements 
of S is a limit of a sequence of elements in 7. Since S is assumed to be an 
orthonormal basis, this implies that (v,)° is dense in H. 


To complete Theorem 4.2, we want to know whether two orthonormal 
bases in a separable Hilbert space have the same number of elements. 


Theorem 4.3. Suppose H is a separable Hilbert space. If dimH = 
N < ©, then any orthonormal basis for H is a basis for H as a vector space, 
and therefore has N elements. 

If H is not finite dimensional, then any orthonormal basis for H is countable. 


Proof. Suppose dim H = N < o, and suppose S ¢ H is an ortho- 
normal basis. If e,,..., e,, are distinct elements of S and 


N 
> a,e, = 0, 
n=1 


0 = (> a€n, €n) = Ams m= 1,...,M. 


Thus the elements of S are linearly independent, so S has < N elements. 
Let S = {e1, €o,..., €y}. We want to show that S is a basis. Let H, be the 
subspace spanned by S. Given u € H, let 


then 


M 
u = > (U,e,)en. 
n=1 


Then 
(u — u,,e,,) = 0, m= 1,...,M. 


It follows that u — u, is orthogonal to the subspace H,. The argument used 
to prove Lemma 2.4 shows that u, 1s the element of H, closest to u. But by 
assumption on S, there are elements of H, arbitrarily close to u. Therefore 
u = u, € H,, and S is a basis. 

The argument just given shows that if H has a finite orthonormal basis S, 
then S is a basis in the vector space sense. Therefore if H is not finite di- 
mensional, any orthonormal basis is infinite. We want to show, therefore, 
that if H is separable and not finite dimensional then any orthonormal basis 
is at most countable. Let (u,)? be dense in H, and let S be an orthonormal 
basis. For each element e € S, there is an integer n = n(e) such that 


le — u,|| < 2777, 
Suppose e, fe S and e # f. Let nm = n(e), p = n(f). Then 


le — £]? = lle? + ©, f) + @e) + [fll? 
1+0+0+1=2, 
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SO 
lu, — u,| = ju, -et+te—f+f-4,| 
> |e —f| — lu, --e +f -u,| 
> lle — fl| — lu, — ell — [ff - u,| 


> 21/2 —_ 2-1/2 _ 2-1/2 — 0 


Thus u, 4 u,. We have shown that n(e) 4 n(f) if e 4 f, so the mapping 
e —> n(e) is a 1-1 function from S to a subset of the integers. It follows that S 
is finite or countable. { 


Exercises 


1. Let X, be the vector space of continuous complex-valued functions 
defined on the interval [—1, 1]. If u, ve X,, let an inner product (u, v), be 
defined by 


1 


(u,v), = | ; u(x)v(x)* dx. 


Let u,(x) = x*-1, n= 1,2,.... Carry out the Gram-Schmidt process of 
Lemma 4.1 to find polynomials p,, n = 1, 2, 3, 4 such that p, is of degree 
n — 1, p, has real coefficients, the leading coefficient is positive, and 

(Pn Pm) =1 ifn= mM, (Pr Pm)1 =0 ifnAm. 


These are the first four Legendre polynomials. 
2. Let X, be the set of all continuous functions u: R— C such that 


| |u(x)|2e-* dx < 00, 


Show that X, is a vector space. If u, v € X2, show that the integral 


oo 


(u, v)o = [ u(x)u(x)e-* dx 


exists as an improper integral, and that this defines an inner product on Xg. 
Show that there are polynomials p,, n = 1, 2, 3,... such that p, is of degree 
n — 1 and 
(Pan Pn2=1 ifn=m, (Pa Pm)l2a=90 ifn Am. 

Determine the first few polynomials of such a sequence. Except for constant 
factors these are the Laguerre polynomials. 

3. Show that there is a sequence (p,)¢ of polynomials such that p, is 
of degree n — 1 and 


| | Px(x)|2e7 2/2? dx = 1, 


oO 


| Pa(X) Pm(x)e~ 2? dx = 0 ifn ~ m. 


Except for constant factors these are the Hermite polynomials. 
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4. Suppose H is a finite dimensional complex Hilbert space, of dimension 
N. Show that there is a linear transformation U from H onto C” such that 


(Uu, Uv) = (u, v), all u, ve H. 


(Hint: choose an orthonormal basis for H.) 

5. Suppose H and H’ are two complex Hilbert spaces of dimension 
N < oo. Show that there is a linear transformation U from H onto H’ 
such that 


(Uu, Uv) = (u, v), all u, v EH. 


6. In the space /? of two-sided complex sequences, let e, be the sequence 
with entry 1 in the nth place and all other entries 0. Show that (e,)”. is an 
orthonormal basis for /?. 

7. Show that there is a linear transformation U from /? onto /,? such 
that 

(Ux, Uy) = (x, y), all x, ye /? 


§5. Orthogonal expansions 


Suppose H is a Hilbert space of dimension N < oo. We know that H 
has an orthonormal basis {e,, e,,..., ey}. Any element ue H is a linear 
combination 


N 

(1) u = > An€ns 
n=1 

and as in §4 we see that 

(2) a, = (U, e,). 


It follows that if u, ve H then 


(3) (u,v) = > (wend, ¥) 


In particular, 


(4) jal? = 2, \(u, e,)|2. 


The expression (1) for ue H with coefficients given by (2) is called the 
orthogonal expansion of u with respect to the orthonormal basis {e,, ..., ey}. 

We are now in a position to carry (1)-(4) over to an infinite-dimensional 
separable Hilbert space. 


Theorem 5.1. Suppose H is a Hilbert space with an orthonormal basis 
(e,)7. Jf ue H, there is a unique sequence (a,)~ of scalars such that 


(5) u= > a, 
n=1 
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in the sense that 


—>0 as N->o. 


(6) 


N. 
u— > a,en 
n=1 


The coefficients are given by 


(7) a, = (U, en); 

and they satisfy 

(8) > lal? = [ul 

More generally, if 

(9) u = > ane, and v= > ben 
then 

(10) (u,9) = > abt = > Geen 


Conversely, suppose (a,)? is a sequence of scalars with the property 
(11) > lan]? < 00. 
n=1 


Then there is a unique element uEH such that (5) is true. 


Proof. First let us prove uniqueness. Suppose (a,)° is a sequence of 
scalars such that (6) is true. Let 


N 
(12) Uy = > An€n- 
n=1 


Since the sequence (e,)? is orthonormal, 
(uy, €,) = Qn if N > n. 
Using Lemma 2.1 we get 
a, = dim (Uy, €n) = (U, en). 
Thus (a,)? is unique. 
To prove existence, set a, = (u, e,). Define uy by (12). Then 
(uy, €,) = (U, e,), l<n<QN. 


This implies that u — uy is orthogonal to the subspace Hy spanned by 
fe,,..., @y}. Now given e > 0, there is a linear combination v of the e, 
such that 

lu — vl <e. 
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Then there is an Ny such that ve Hy when N = Np. As in the proof of 
Lemma 2.3, the facts that uy ¢ Hy and that u — uy | Ay imply 


Ju — uyl| < ju — vi. 


Thus uy — u. Since the e, are orthonormal, 


uy ||? = (uy, Uy) = > |an|?. 


n=1 


Thus 


|u|? = lim uy? = > Jayl? 


More generally, suppose u and v are given by (9). Let uy be defined by (12), 
and let vy be defined in a similar way. Then by Lemma 2.1, 


N foe) 
(u, v) = lim (uy, vy) = lim > a,b* = > a,bd¥. 
n=1 


r= 


a 


Finally, suppose (a,)? is any sequence of scalars satisfying (11). Define 
uy by (12). All we need do is show that (uy)? is a Cauchy sequence, since we 
can then let u be its limit. But if N > M, 


N 
(13) [uy — Uy,||? = (dy — Uy, Uy — Uy) = > |a,|?. 
n=M+1 


Since (12) is true, the right side of (13) converges to zero as M, N-> 00. [J 


It is convenient to have the corresponding statement for a Hilbert space 
with an orthonormal basis indexed by all integers. The proof is essentially 
unchanged. 


Theorem 5.2. Suppose H is a Hilbert space with an orthonormal basis 
(e,)~ .. Jue H, there is a unique two-sided sequence (a,)~ . of scalars such 
that 


(14) U= > Aen, 
in the sense that 
N 
(15) u — > a,€,\| >~0 as N->oo. 
n=—N 


The coefficients are given by 
(16) a, = (U, en), 


and they satisfy 


ie.0) 


(17) > lanl? = |ul? 


n=— 0 
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More generally, if 


then 
(18) (u,v) = >, anb¥ = > (u, en)(p, ¥). 
Conversely, suppose (a,,)~ ~. is a sequence of scalars with the property 


> la, |? < 00. 
Then there is a unique ué H such that (15) is true. 


The equations (5), (6), or (14), (15) give the orthogonal expansion of u 
with respect to the respective orthonormal bases. The identity (8) or (17) is 
called Bessel’s equality. It implies Bessel’s inequality: 


(19) 2, I(a, e,)|? < [ul? 
or 
(20) 2, , I(a, en)/? < al, 


The identity (10) or (18) is called Parseval’s identity. The coefficients a, 
given by (6) or (14) are often called the Fourier coefficients of u with respect to 
the respective orthonormal basis. 


Exercises 


1. Suppose H and Hy, are two infinite-dimensional separable complex 
Hilbert spaces. Use Theorems 4.2, 4.3, and 5.1 to show that there is a linear 
transformation U from H onto H, such that 


(Uu, Uv) = (u, v), all u, ve H. 
Show that U is invertible and that 
(U~*u,, U~*v1) = (Uy, ¥:), all u,, v, € Hy. 


Such a transformation U is called a unitary transformation, or a unitary 


equivalence. 
2. Let U: 1,2 1,2 be defined by 


U((a,, Qo, A3,.. .)) = (0, a1, Ag, A3,.. .). 
Show that U is a 1-1 linear transformation such that 
(Ux, Uy) = (x,y), all x, ye 1,7. 


Show that U is not onto. 
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$6. Fourier series 


Let L? be the Hilbert space introduced in §1. Thus L? consists of each 
periodic distribution F which is the limit, in the sense of L?, of a sequence 
(u,)y < @, 1e., 


[un — Um|| — 9, 
F,,—>F (#). 


We can identify the space @ of continuous periodic functions with a subspace 
of L? by identifying the function u with the distribution F,,. Then 


\Ful? = ful? = 5- | |uGol? ae. 


In particular, we may consider the two-sided sequence of functions e,, 
€,(x) = exp (inx), n=0, +1, +2,... 
as elements of L?. 


Lemma 6.1. The sequence of functions (e,)” .., considered as elements 
of L?, is an orthonormal basis for L?. 


Proof. Clearly |e,|| = 1. If m #4 n, then 
27 2 
| €,(X)Cm(x)* dx = | exp (inx — imx) dx 
0 0 
= [i(n — m)]~* exp (i(n — m)x)|§” = 0. 


Thus (e,)2. is an orthonormal set. Now suppose Fe L?. Given « > 0, 
there is a function u € @ such that 


IF. — Fil <e. 

There is a linear combination v = > a,e, such that 
jv —ul <e. 

Then 


|F.- Fl < | -Al + |A- Fl 
= |v —ol| + |A- Fl 
<|v—ul +e < 2e. 0 


Since (e,)%.~ is an orthonormal basis, we may apply Theorem 5.2 and 
obtain orthogonal expansions for distributions in L?. 


Theorem 6.2. Suppose FeL?. There is a unique two-sided sequence 
(An)~ «@ < C such that 


(1) F= S Ay, exp (inx) 


n=—-@ 
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in the sense that the functions 


N 
(2) > Ane, > F(L*) as N->o. 


n=— 0 


The coefficients are given by 


(3) a, = F(e-n), 
and they satisfy 
(4) >, lanl? = FI. 


More generally, if 
F= > a, exp (inx) and G= > b,, exp (inx) 
in the sense of (2), then 


(5) (F,G) = > abt = > F(e-n)G*(ey) 


-— © — © 


Conversely, suppose (a,)~ » < C and 
> |a,|? < 00. 


Then there is a unique F € L? such that (1) is true, in the sense of (2). 


Proof. In view of Lemma 6.1 and Theorem 5.2, we only need to verify 
that 
(F, F..) = F(€_n)s G(e_,)* = G*(e,). 


The second identity follows from the first and the definition of G*. To prove 
the first, take (u,,)2 © C, 


Un > F (L?). 
Then 
(F, Fe) = lim (up, Cn) = lim F,,(@ -n) — F(e_,). U 


The a, 1n (3) are called the Fourier coefficients of the distribution F. The 
formal series on the right in (1) is called the Fourier series of F. 
Suppose F = F, where u€é@. Then (2) is equivalent to 


(6) ) ” |u(x) — > a, exp (inx)|? dx + 0, 


where 


(7) a, = ~ [- u(x) exp (—inx) dx. 
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In fact, (6) and (7) remain valid when u is simply assumed to be an integrable 
function on [0, 27]. In this case the a, are called the Fourier coefficients of 
the function u, and the formal series 


> Ay, exp (inx) 


is called the Fourier series of the function u. The fact that (6) and (7) remain 
valid in this case is easily established, as follows. If u: [0, 27] — C is integ- 
rable, then again it defines a distribution F,, by 


F.(v) = Pa { cow) dx. 


Then an extension of the Schwarz inequality gives 


FOL < (xo [  lueol? ax)" Fol = [al Lol. 


0 


By Exercise 2 of §2, F,, € L?. 
When we @, it is tempting to interpret (1) for F, as 


u(x) = Ss An, eXp (inx). 


In general, however, the series on the right may diverge for some values of 
x, and it will certainly not converge uniformly without further restrictions 
on u. It is sufficient to assume that u has a continuous derivative. 


Lemma 6.3. Jf (a,)2. © C has the property 


() > la] < 00, 


then the functions 
N 
uy = > An€n 
n=-N 


converge uniformly to a function u € @, and (a,)® ~ is the sequence of Fourier 
coefficients of u. 


Proof. Since |e,(x)| = |exp (inx)| = 1 for all x eC, it follows from (8) 
that the sequence of functions (uy)? is a uniform Cauchy sequence. Therefore 
it converges uniformly to a function ue @. For N > m we have 


(uy, Cm) = Ay. 
Thus 
an = lim (uy, Cm) = (u, Cm) 


is the mth Fourier coefficient of u. [ 
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Theorem 6.4. If ue@ and u is continuously differentiable, then the 
partial sums of the Fourier series of u converge uniformly to u. Thus 


u(x) = > (u, €,) exp (inx), allx ER. 


Moreover, if ue F then the partial sums converge to u in the sense of F. 
Proof. Let v = Du. There is a relation between the Fourier coefficients 
of v and those of u. In fact integration by parts gives 


22 


(9) b, = (v,&) = os Du(x) exp (—inx) dx 


in [?" 
= 57 ) u(x) exp (—inx) dx 


= in(u, e,) = ina,,. 
We can apply the Schwartz inequality for sequences to show > |a,| < ©. 
In fact 


> |an| = |@o| + > lan] = |ao] + > n~*|b,| 
n#0 n#0 
1/2 1/2 
< |ap| +(> n-*) (> 2.1) 


n#0 
co 1/2 
= |ao| + (2 > n-4) || Dul] < 0. 
n=1 


By Lemma 6.3, the partial sums of the Fourier series of u converge uniformly 
to u. 
Now suppose ue & and let 


N 
un = > Qnen- 


n=—N 


Then (9) shows that 


N N 
Duy = > ina,en = > ben 
n=-N n=-N 
is the Nth partial sum of the Fourier series of Du. Similarly, D*uy is the Nth 
partial sum of D*u. Each D*u is continuously differentiable, so each D*uy > 
D*u uniformly. { 


Fourier series expansions are very commonly written in terms of sine 
and cosine functions, rather than the exponential function. This is partic- 
ularly natural when the function u or distribution F is real. Suppose F € L?. 
Let 


(10) a, = F(e_n)s b, = 2F(cos nx), C, = 2F(sin nx). 


Since 
e_,(x) = cosnx — isinnx 
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we have 
a, = 4(b, — ic,). 
Also 
b_, = b,, Cin = —Cy 
Thus for n > 0, 


An€y(X) + A-ne-n(X) = a,(cos nx + isin nx) + a_,(cos nx — isin nx) 
= b, cosnx + c, sin nx. 


Then 
N N 
> a, exp (inx) = 4bo + > b, COS nx + c, sin nx. 
n=-N n=1 
The formal series 


(11) dbo + > b, cos nx + c, sin nx 

n=1 
is also called the Fourier series of F, and the coefficients b,, c, given by (10) 
are also called the Fourier coefficients of F. If F is real, then b,, c, are real, 
and (11) is a series of real-valued functions of x. Theorems 6.2 and 6.4 may 
be restated using the series (11). 


Exercises 


1. Find the Fourier coefficients of the following integrable functions on 
[0, 27]: 


(a) u(x) = 0, xe [0, z], u(x) = 1, x E(x, 2a]. 

(b) u(x) = 0, xe [0, z], u(x) = x — a, x E(x, 27]. 
(c) u(x) = |x — aI. 

(d) u(x) = (x — 7). 

(e) u(x) = x 

(f) u(x) = |cos x]. 


2. Suppose u € @ and suppose b,, c, are as in (10). Show that if u is even 
then c, = 0, all n. Show that if u is odd, then b, = 0, all n. (It is convenient 
to integrate over [—7, 7] instead of [0, 27].) Show that if wu is real then 


1 2 ; eae 3 
P+ 5 D ( (b,, + Cn’) = 57 u(x)? dx. 


a 


3. Suppose ue @, 


>) (u, @-n)€n- 


n=-—-N 
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Show that 
1 20 
un(x) = 5- | Dale — yuty) ay, 
0 


where 
Dy(x) = sin (N + 4)x/sin 4x. 
The function Dy is called the Dirichlet kernel. Thus 
Uy = Dy * U. 


4. Extend the result of Exercise 3 to the partial sums of the Fourier 
series of a distribution F € L?. 
5. For Fe F#, define 


a, = F(e_,), n=0, +1, +2,.... 


Show that F € L? if and only if 


|an|? < 00, 


ss 


6. Suppose Fe #’ and DF € L?. Show that F = F, for some continuous 
function u. (Hint: find the Fourier coefficients of w.) 


Chapter 5 


Applications of Fourier Series 


§1. Fourier series of smooth periodic functions 
and of periodic distributions 


If uw is a smooth periodic function with Fourier coefficients (a,)°.., then 
we know that the sequence (a,)2.. uniquely determines the function uw; in 
fact, the partial sums 


(1 s(x) = > ay exp (inx) 


of the Fourier series converge to u in the sense of A Therefore it makes sense 
to ask: what are necessary and sufficient conditions on a two sided sequence 
(a,)~« < C that it be the sequence of Fourier coefficients of a function 
ue #? The question is not hard to answer. 

A sequence (a,)2.. © C is said to be of rapid decrease if for every r > 0 
there is a constant c = c(r) such that 


(2) la,| < cln|~-", alln 40. 


Theorem 1.1. A sequence (a,)2. © C is the sequence of Fourier coeffi- 
cients of a function u € F if and only if it is of rapid decrease. 


Proof. Suppose first that ue FY has (a,)2., as its sequence of Fourier 
coefficients. Given r > 0, take an integer k = r. In proving Theorem 6.4 of 
Chapter 4 we noted that (ina,)® .. is the sequence of Fourier coefficients of Du. 
It follows that 


((in)*a,)* 
is the sequence of Fourier coefficients of D*u. But then 
jn|*|a,| < |D*ul, 


which gives (2) with c = | D*ul. 
Conversely, suppose (a,)”. © C is a sequence which is of rapid decrease. 
Define functions uy by (1). From (2) with r = 2 we deduce 


00 
> lan| < ©. 
— © 


By Lemma 6.3 of Chapter 4, the functions (1) converge uniformly to ue %, 
and (a,)~.. are the Fourier coefficients of u. Also 


N 


D'uy = > (in)*anens 


-N 
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and (2) with r = k + 2 implies 


fe @] 


>, lal*|aa| < 00. 


Thus each derivative D*u, also converges uniformly as N-> oo. Therefore 
uEeZF J 


If F is a periodic distribution which is in L?, then we have defined its 
Fourier coefficients by 


(3) a, = F(e_n), €_n(x) = exp (—inx). 


Since e_, € # the expression in (3) makes sense for any periodic distribution 
F, whether or not it is in L?. Thus given any Fe #, we define its Fourier 
coefficients to be the sequence (a,)2.. defined by (3). We know that if all the 
a, are zero, then F = 0 (see Chapter 3, Theorem 4.3). Therefore, Fe 7’ 
is uniquely determined by its Fourier coefficients, and we may ask: what are 
necessary and sufficient conditions on a sequence (a,)*., < C that it be the 
sequence of Fourier coefficients of a periodic distribution F? Again, the 


answer is not difficult. 
A sequence (a,)2.~. © C€ is said to be of slow growth if there are some 


positive constants c and r such that 
(4) lan| <cln|", = alln 4 0. 


Theorem 1.2. A sequence (a,)~%.< C is the sequence of Fourier co- 
efficients of a distribution F € F’ if and only if it is of slow growth. 


Proof. Suppose first that Fe # has (a,)2. as its sequence of Fourier 
coefficients. Recall that for some integer k, F is of order k. Thus foruceF 


|F(u)| < e(ju| + |Du| +---+ |D*u)). 
With u = e_, this means 
la,| = |F(u)| < cll + [nl +--+ + [n]*) < 2eln|*. 


Thus (a,)2. is of slow growth. 
Conversely, suppose (a,)2.. © € is a sequence which is of slow growth. 
Then there is an integer kK > 0 such that 
(5) lan] < cln|*¥-2, an £0. 
Let db) = 0 and 
b, = (in)~“a,, n#0. 
Let 


N 
UN => > brn: 
-N 


From (5) we get 


S |bn| < 00. 
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Therefore vy converges uniformly to ve @. Let 
F = D*F, + F;, f= Ao€o. 


This is a distribution; we claim that its Fourier coefficients are the (a,)2 ». 
In fact, for n 4 0, 


F(e_,) = D* Fe -n) —- F((— D)*e_,) 
= (in)*F(e_,) = (in)*b, = ay. 
Also 
F(€9) = D*F (eo) + Fy(€o) = 0 + ap. O 


In the course of the preceding proof we gave a second proof of the 
characterization theorem for periodic distributions, Theorem 6.1 of Chapter 3. 
In fact, the whole theory of 7 and # in Chapter 3 can be derived from the 
point of view of Fourier series. We shall do much of such a derivation in this 
section and the next. An important feature of such a program is to express 
the action of Fe #’ on ue F in terms of the respective sequences of Fourier 
coefficients. 


Theorem 1.3. Suppose Fe F' has (a,)~ ~ as its sequence of coefficients, 
and suppose u& F has (b,)® » as its sequence of Fourier coefficients. Then 


(6) F(u) = > arb-n = > a-ndn. 
Proof. Let 


uy(x) = S b, exp (inx). 


We know 
Uy —>U (A). 
Therefore 
F(u) = lim F(uy). 
But 


N N N 
F(uy) = 2, b,F (en) = 2, a_y»b, = » a,b _». U 


Implicit in the proof of Theorem 1.3 is the proof that the series in (6) 
converges. A more direct proof uses the criteria in Theorem 1.1 and 1.2. 
In fact, 


lan] <cln|’, nO, 
[bn| < c'|ln[-"-2,, an #0 


and convergence follows. 
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Corollary 1.4. Jf Fe FY has Fourier coefficients (a,)~, and if Fy is 
the distribution defined by the function 


N 
> a, exp (inx), 
—N 
then 
Fy > F(#). 
Proof. With u, uy as in Theorem 1.3, 
Fy(u) = F(uy) > Fu). 0 
Exercises 


1. Compute the Fourier coefficients of 6 and D*6. 
2. If Fe F has Fourier coefficients (a,)°.., compute the Fourier coeffi- 
cients of 
T,F, F*, F”, 


3. Give necessary and sufficient conditions on the Fourier coefficient 
(a,)~ . of a distribution F that F be real; or even; or odd. 
4. Suppose (9,,)° < @ is an approximate identity. Let 


(Am,n)n= - © 
be the sequence of Fourier coefficients of ,,. Show that 


lam.n| < 1, m>l, n=0, +1, +2,...; 
lim ann = 1, all n. 


m— 0 


§2, Fourier series, convolutions, and approximation 


Recall that if F, GEA and we Z the convolutions F « u and F « G are 
defined by 
F* u(x) = F(T,u), 
(F x G)(u) = F(G™ * uv). 
We want to compute the Fourier coefficients of the convolutions in terms of 


the Fourier coefficients of F, G, and u. 


Theorem 2.1. Suppose Fe FY’ has Fourier coefficients (a,)~ .; suppose 
Ge has Fourier coefficients (b,)~ .; and suppose ué€ F has Fourier coeffi- 
cients (C,)~.. Then F*u has Fourier coefficients (a,c,)2.. and F* G has 
Fourier coefficients (a,b,)~ «- 


Proof. Note that 
(TE, )(Y) = e_ly — x) = €,(x)e_n(y). 
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Therefore 
(F * €n)(x) = €,(x)F (e_n) = Ane n(x), 


and 


N N 
F x (> cx] = > AnCnen- 
-N 


—N 


Taking the limit as N->0oo, we find that F*wu has Fourier coefficients 


(2nCn)~ wo. 
Now 
(G~ * €,,)(x) = G(T. En) = e,(x)G” (é,) 
= €n(x)G(En) = b_,€,(X). 
Therefore 


(F* G)(e_,) = F(G~ *e_,) = by F(e—n) = Andy. U 


Using Theorem 2.1 and Theorems 1.1 and 1.2, we may easily give a 
second proof that F « ue ZF Similarly, if ue A and G = F,, then 


FeG=F,, where v= Feu; 


in fact F * G and F * u have the same Fourier coefficients. 
The approximation theorems of Chapter 3 may be proved using Theorem 
2.1 and the following two general approximation theorems. 


Theorem 2.2. Suppose (un)? < Y Suppose the Fourier coefficients of 
Um are 


(n,n) n: =—o- 
Suppose that for each r > O there is a constant c = c(r) such that 
lam.n| <cln|~",  allm,alln 4 0. 
Suppose, finally, that for each n, 


ann —_> an as m —_> CO. 


Then (a,)~ « is the sequence of Fourier coefficients of a function ue Z More- 
over, 


Un —>Uu (A). 
Proof. The conditions imply that also 
lan] <cln|-", nO. 


Thus (a,)%.. is the sequence of Fourier coefficients of a function ue F 
Given e > 0, choose N so large that 


c(2) > n-* <e, 
n=N 
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Choose M so large that m => M implies 


N 
> lQn.n — @n| < €. 
-N 

It follows that if m > M then 


lQn,n — Qn| < Se. 


sMe 


Since 


u= > Anens Un = > Am,n€ns 
this implies 
lun, —~ul < Se ifm< M. 
Thus u,, —> vu uniformly. A similar argument shows each D*u,,—> D*u. 


Theorem 2.3. Suppose (F,,) < #. Suppose the Fourier coefficients of 
F, are 
(Qm,n)n= - 0 


Suppose that for some r > 0 there is a constant c such that 
lam.n| < cln|’, all m, alln # 0. 
Suppose, finally, that for each n, 


Ann —~ 4, and m-—>o. 


Then (a,)~ © is the sequence of Fourier coefficients of a distribution Fe F’. 
Moreover, 
F,—2>F (FY). 


Proof. The conditions imply that also 
lan] < cln|", alln 40. 


Thus (a,)2. is the sequence of Fourier coefficients of a distribution Fe #. 
Take an integer k > r + 2, and let 


Bn,o = Bo = 0, 
Bn,n = (in) "Gn ns n#0 
b, = (in)~ “az, n #0. 


As in the proof of Theorem 1.2, (On.,)r=-.0 iS the sequence of Fourier 
coefficients of a function v,, € @, with 


Fn = D‘F,,, + Fy tn = Gn,0o- 
Similarly, (5,)2.. 18 the sequence of Fourier coefficients of ve @ with 


F= D*F, + F;, f= Ap€o- 
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The hypotheses of the theorem imply 


lbm.n| < cln|-2, n #0, 
Binn > bn AS M->OO. 


As in the proof of Theorem 2.1, these conditions imply that 
Um —>v uniformly as m-—>oo. 


Also, 


Qm,0 —_> Qo. 


It follows that F,,—> F (#7). Oo 


Exercises 


1. Suppose (9,,)° < @ is an approximate identity. Use the theorems of 
this section and Exercise 4 of §1 to prove that: 


Pm *U—>U (P) ifueF; 
Fxo,>F (Ff) if FeF. 


2. State and prove a theorem for L? which is analogous to Theorem 1.2 
for F and Theorem 1.3 for @. 

3. Use the result of Exercise 2 to show that if (y,,)? © @ is an approxi- 
mate identity and Fe L?, then 


F * om — F (L?). 


4. Prove the converse of Theorem 2.2: if u,,—>u (P) then the Fourier 
coefficients satisfy the hypotheses of Theorem 2.2. 


§3. The heat equation: distribution solutions 


Many physical processes are approximately described by a function u, 
depending on time and on position in space, which satisfies a type of partial 
differential equation called a heat equation or diffusion equation. The simplest 
case is the following. Find u(x, t), a continuous function defined for x € [0, z] 
and for ¢ > 0, satisfying the equation 


(1) = u(x, t) = (A) ur t), x € (0, 7), t> 0, 


the initial condition 


(2) u(x, 0) = g(x), x € [0, x], 
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and one of the following two sets of boundary conditions: 


(3) u(0,t)=u(7,t)=0, ¢20; 
3)’ o 0, t a t)=0 t>0O 
(3) 5, HOO, )= au, — Vy» > V. 


The function u describes the temperature distribution in a thin homo- 
geneous metal rod of length 7. The number x represents the distance of the 
point P on the rod from one end of the rod, the number f¢ represents the time, 
and the number u(x, ¢) the temperature at the point P at time t. Equation (1) 
expresses the assumption that the rod is in an insulating medium, with no 
heat gained or lost except possibly at the ends. The constant « > 0 is pro- 
portional to the thermal conductivity of the metal, and we may assume units 
are chosen so that x = 1. Equation (2) expresses the assumption that the 
temperature distribution is known at time ¢ = 0. Equation (3) expresses the 
assumption that the ends of the rod are kept at the constant temperature 0, 
while the alternative equation (3)’ applies if the ends are assumed insulated. 
Later we shall sketch the derivation of Equation (1) and indicate some other 
physical processes it describes. 

Let us convert the two problems (1), (2), (3) and (1), (2), (3)’ into a single 
problem for a function periodic in x. Note that if (2) and (3) are both to hold, 
we should have 


(4) 8(0) = g(a) = 0. 


Let g(—x) = —g(x), x €(—7, 0). Then g has a unique extension to all of 
IX which is odd and periodic (period 27). Because of (4) the resulting function 
is still continuous. Suppose u were a function defined for all xe Rand? > 0, 
periodic in x, and satisfying (1) for all x Ee R, t > 0. Then u(x, 0) = g(x) is 
odd. If g is smooth, then also 


(=) “Wx, 0) = D2g(x) 


is odd, and we might expect that u is odd as a function of x for each t > 0. 
If this is so, then necessarily (3) is true. 

Similarly, if (2) and (3)’ are both to hold, we would expect that if g is 
smooth then 


Dg(0) = Dg(n) = 0. 


In this case, let g(—x) = g(x), x €(—7, 0). Then g has a unique extension 
to all of R which is even and periodic. If g is of class C+ on [0, 27], the ex- 
tension is of class C* on R. Again, if u were a solution of (1), (2) which is 
periodic in x for all t, we might expect wu to be even for all ¢. Then (3)’ is 
necessarily true. 

The above considerations suggest that we replace the two problems 
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above by the single problem: find u defined for x Ee R, t > 0, such that wu is 
periodic in x, 


(5) 2 x,t) = (2)'ux, 1) ER, t>0 
ato a Ox UX, ’ x ) ) 


(6) u(x, 0) = g(x), xeER, 


where g € @ is given. 

Jt is convenient and useful to ask for solutions of an analogous problem 
for periodic distributions. We formulate this more general problem as follows. 
Suppose that to each ¢ in some interval (a, b) € R we have assigned a distri- 
bution F;e 7’. If se(a, b), GEF, and 


(¢-s) "(Rh -F)>G (F) 
as t > s, it is natural to consider G as the derivative of F, with respect to ¢ 


at t = s. We do so, and write 


d 
G=ah 


t=s 


Our formulation of the problem for distribution is as follows: given Ge #, 
find distributions F, ¢ 7’ for each ¢ > 0, such that 


& — 72 
(7) qi|..= DF» alls > 0, 


(8) F.>G (#) as t>0. 


Theorem 3.1. For each GE#' there is a unique family (F.)i.5 < F 
such that (7) and (8) hold. For each t > 0, F, is a function ux) = u(x, t) 
which is infinitely differentiable in both variables and satisfies (5). 


Proof. Let us prove uniqueness first. Suppose (F;);,, is a solution of 
(7), (8). Each F; has Fourier coefficients (a,(t))° «, 


a,(t) = F(e_,), t>0,n=0, +1,.... 
Then 


(t — s)[a,(t) — a,(s)] > D?F(e-,) = F,(D’e_n) = —n?F(e_») 
as t > s. In other words, 
(9) Da,(t) = —n?*a,(t), t> 0. 
As t 0 we have 
(10) a,(t) = F(e_,) > G(e-n) = dn. 
The unique function a(t), t > 0 which satisfies (9) and (10) is 


(11) a,(t) = b, exp (—n?"?). 
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This shows that a,(t) are uniquely determined, so the distributions F, are 
uniquely determined. 

To show existence, we want to show that the a,(t) defined by (11) are the 
Fourier coefficients of a smooth function for ¢ > 0. Recall that if y > 0 
then 


ev = > (n!)~1y" > (m!)~ty™ 


SO 
e~Y% < ml yo™, 
Therefore 
|a,(t)| < |b,|m!n-?"-t-™, = m=0,1,2,.... 
But 


lbn| < cln|*, = alla, 
for some c and k. Therefore 
|a,(t)| < cm! |n|*¥-?2"-27-”, m=0,1,2,.... 


It follows that for each t > 0, (a,(t))2. is the sequence of Fourier coeffi- 
cients of a function u,¢ ZA Then y; is the uniform limit of the functions 


ule t) = > ay(t) exp (inx) 


—N 
N 


= > b, exp (inx — n*t). 
—N 


Then 
OV (2 - (x,t) = S b,(—n?)'in)' exp (inx — n?t) 
Ot Ox N\*> aa n 
N 
= > ay,.,(0) exp (inx). 
—N 
As above, 


lan.ar(t)| < cm! [n[kt2tr—2".¢-™ =m = 0,1,2,.... 


It follows that each partial derivative of uy converges as N — oo, uniformly 
forxeRandt = 6 > 0. Thus 


u(x, t) = u(x) 


is smooth for x € R, t > 0. For each N, uy satisfies (5). Therefore u satisfies 
(5). 

Let F; be the distribution determined by ™, ¢ > 0. Thus the Fourier 
coefficients of F,; are the same as those of 1. 

It follows from (11) that 


la,(t)| < |b,|, a(t) > 5b, as tO. 
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By Theorem 2.3, therefore, (8) is satisfied. Finally, each partial derivative 
of u is bounded on the region x ER, t > 6 > 0. It follows from this and the 
mean value theorem that 


4) 
(t ~~ s)~*(u, _ Us) —> a Us 


uniformly as t-> s > 0. Therefore (7) also holds. gj 


Exercises 


1. In Theorem 3.1, suppose G is even or odd. Show that each F, is re- 
spectively even or odd. Show that if G is real, then each F,; is real. 

2. Discuss the behavior of F, as t —> 00. 

3. Formulate the correct conditions on the function u if it represents the 
temperature in a rod with one end insulated and the other held at constant 
temperature. 

4. In Theorem 3.1, suppose G = F,, the distribution defined by a 
function w € & Show that the functions u, > w (P) as t> 0. 

5. Let 


ce.) 


g(x) = > exp (—n?t + inx), t>0, xeR. 


Show that g,¢€ Z In Theorem 3.1, show that 
u= G* g,. 
6. With g, as in Exercise 5, show that 
St * §s = St+s: 
7. The backwards heat equation is the equation (1) considered for ¢ < 0, 
with initial (or “final’’) condition (2). Is it reasonable to expect that solutions 


for this problem will exist? Specifically, given G ¢ F’, will there always be a 
family of distributions (F;);<9 © # such that 


£F, = D*F., all s < 0, 
t=s 


F,>G (#) as t->0? 
8. The Schrédinger equation (simplest form) is the equation 
ou _ 
at ax 
Consider the corresponding problem for periodic distributions: given 
Ge’, find a family (F;);>9 of periodic distributions such that 


d _ ane 
fr = iD, all s > 0, 


F,—>G (#’) as t—-0. 


Discuss the existence and uniqueness of solutions to this problem. 
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$4. The heat equation: classical solutions; derivation 


Let us return to the problem given at the beginning of the last section: 
find u(x, t), continuous for x € [0, 27] and t > 0, and satisfying 


0 0 \2 
(1) Huet) =(Z) uxt), xeQm), 105 
(2) u(x, 0) = g(x), where geC/[0, 7) is given; 
(3) u(0, t) = u(z,t) = 0, +20 
or 
(3) © 40,1) =~ um, t)=0, 1t>0 
ax UN 1) = By AF) = Us 


Such a function w is called a classical solution of the problem (1), (2), (3) 
or (1), (2), (3)’, in contrast to the distribution solution for the periodic 
problem given by Theorem 3.1. In this section we complete the discussion 
by showing that a classical solution exists and is unique, and that it is given 
by the distribution solution. We consider the problem (1), (2), (3), and leave 
the problem (1), (2), (3)’ as an exercise. 

Given geC[0,7z] with g(0) = g(7) = 0 (so that (3) is reasonable), 
extend g to be an odd periodic function in &, and let G = F,. By Theorem 3.1, 
there is a function u(x, t) = u,(x) which is smooth in x, t for xe R, t > 0, 
satisfies (1) for all such x, t, and which converges to G in the sense of # as 
t—> 0. By Exercise 1 of §3, wis odd as a function of x for each ¢t > 0. Since u 
is also periodic as a function of x, this implies that (2) holds when ¢ > 0. 
If we knew that u, —> g uniformly as t — 0, it would follow that the restriction 
of uto 0 < x < wis a classical solution of (1), (2), (3). Note that this is true 
when (the extension of) g is smooth: see Exercise 4 of §3. Everything else we 
need to know follows from the maximum principle stated in the following 
theorem. 


Theorem 4.1. Suppose u is a real-valued classical solution of (1), (2). 
Then for each T > 0, the maximum value of u(x, t) in the rectangle 


O<x<n, OK<t<T 
is attained on one of the three edges t = 0, x = 0, orx = 7. 
Proof. Given « > 0, let 
v(x, t) = u(x, t) — et. 


It is easy to see that the maximum value of v is attained on one of the edges 
in question. Otherwise, it would be attained at (xo, to), where 


Xo E (0, 1), to > 0. 


For v to be maximal here, we must have 


0 
at V(Xo; to) > 0. 
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But then 


Oo \2 O \2 O O 
ax V(Xo, to) = ax U(X, to) = aL U(Xo, fo) = at V(Xo, to) + € 


>e>Q0, 


SO Xp cannot be a maximum for v(x, to) on [0, 27]. 
Now uw and v differ by at most «T on the rectangle. Therefore for any 
(x, t) in the rectangle, 


u(x,t) < M + 2eT, 


where M is the maximum of u for t = 0, x = 0, or x = =. Since this is true 
for every « > 0, the conclusion follows. { 


Theorem 4.2. For each continuous g with g(0) = g(7) = 0, there is a 
unique classical solution of problem (1), (2), (3). 


Proof. Note that uw is a classical solution with initial values given by g 
if and only if the real and imaginary parts of u are classical solutions with 
initial values given by the real and imaginary parts of g, respectively. There- 
fore we may assume that g and uw are real-valued. Applying Theorem 4.1 
we see that 


u(x, t) < |g|, all x € [0, z], t> 0. 


Applying Theorem 4.1 to —u, which is a solution with initial values given 
by —g, we get 
—u(x,t)<|g|, all x,t. 
Thus 
|u(x, t)| < |gl, all x, ¢. 
This proves uniqueness. 
To prove existence, let g be extended so as to be odd and periodic. Let 
(on)¢ |< FY be an approximate identity. Let 


Sm = Pn* SEF 

We can choose g,, to be even, so that g,, is odd. Let u,, be the distribution 
solution given by Theorem 3.1 for g,, as initial value, and let u be the distri- 
bution solution with g as initial value. Then we know u,,(x, t) > g,(x) 
uniformly with respect to x at t-> 0, so we may consider u,, as continuous 
for t > 0, xe R. Moreover, since g,,—> g uniformly as m-—> oo, it follows 
that u,,(x, t) > u(x, t), at least in the sense of #’, for each ¢t > 0. On the 
other hand, since |g,, — g| — 0 we find that for x € [0, 7] 


|um(x, t) — up(x, t)| +0 


as m,p—>oo, uniformly in x and ¢, t > 0. Thus we must have u,,—>u 
uniformly. It follows that u has a continuous extension to t = 0, and there- 
fore that u (restricted to x € [0, z]) is a classical solution. 


A second proof of the uniform convergence of u to g as t + 0 is sketched 
in the exercises below. 
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The heat equation, (1), may be derived as follows. Again we consider a 
homogeneous thin metal rod in an insulating medium. Imagine the rod 
divided into sections of length «, and suppose x is the coordinate of the 
midpoint of one section. We consider only this and the two adjacent sections, 
and approximate the temperature distribution at time ¢ by assuming u to be 
constant in each section. The rate of flow of heat from the section centered 
at x + e to that centered at x is proportional to the temperature difference 
u(x + «,t) — u(x,t), and inversely proportional to the distance «. The 
temperature in each section is the amount of heat divided by the volume, and 
the volume is proportional to «. Considering also the heat flow from the 
section centered at x — « to that centered at x, we get an approximate ex- 
pression for the rate of change of temperature at (x, ¢): 


oS u(x, t) © we *[u(x + 2, t) — u(x) + u(x — «, t) — u(x)] 
or 
Fux, t) © xe~*[u(x + 2, t) — ux — e, t) — 2u(x)]. 


Consider the expression on the right as a function of e« and let e—>0. Two 
applications of L’H6pital’s rule give 


(4) | < ule, t)= eq) me t). 


Note that essentially the same reasoning applies to the following general 
situation. A (relatively) narrow cylinder contains a large number of individual 
objects which move rather randomly about. The random motion of each 
object is assumed symmetric in direction (left or right is equally likely) and 
essentially independent of position in the cylinder, past motion, or the 
presence of the other objects. As examples one can picture diffusion of 
molecules or dye in a tube of water kicked about by thermal motion of the 
water molecules, or the late stages of a large cocktail party in a very long 
narrow room. If u(x, t) represents the density of the objects near the point x 
at time ¢, then equation (4) arises again. Boundary conditions like (3)’ 
correspond to the ends of the cylinder being closed, while those like (3) 
correspond to having one way doors at the ends, to allow egress but not 
ingress. 


Exercises 


1. Suppose G = F,, we ZF and suppose (4); is the family of functions 
in Theorem 3.1. Show that if w > 0, then each yu, is = 0. 
2. As in Exercise 5 of §3, let 


g(x) = > exp (—n?t + inx), t> 0. 
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Show that 


2n 


1 
a g(x) dx = 1. 


Show that if we 7 and w = 0, then g; * w => 0. Show, by using any approxi- 
mate identity in Z that g, > 0. 
3. Show that (g;);>9 is an approximate identity as ¢ — 0, 1.e., (in addition 
to the conclusions of Exercise 2) for each 0 < 6 < 7, 
27-6 


lim | g(x) dx = 0. 
(Hint: choose w € F such that w = 0, w(0) = 0, and w(x) = | ford <x < 
27 — 6. Then consider g, « w(0).) 
4. Use Exercise 3 to show that if we @, G = F,, and (u;);59 is the family 
of functions given in Theorem 3.1, then u,—> w uniformly as t + 0. 
5. In a situation in which heat is being supplied to or drawn from a rod 
with ends at a fixed temperature, one is led to the problem 


Hue, t) = (Zz) ue, +fG,1),  xeO,n), 1 >0, 


u(x, 0) = g(x), u(O, t) = u(m, t) = 0. 


Formulate and solve the corresponding problem for periodic distributions, 
and discuss existence and uniqueness of classical solutions. 


§5. The wave equation 


Another type of equation which is satisfied by the functions describing 
many physical processes is the wave equation. The simplest example occurs 
in connection with a vibrating string. Consider a taut string of length 7 
with endpoints fixed at the same height, and let u(x, t) denote the vertical 
displacement of the string at the point with coordinate x, at time f¢. If there 
are no external forces, the function u is (approximately) a solution of the 
equation 


(1) (=) u(x,t) =c (z .) u(x, t), x € (0, 7), t> 0. 


Here c is a constant depending on the tension and properties of the string. 
The condition that the endpoints be fixed is 


(2) u(0,t) = ulr,t)=0, 120. 


To complete the determination of u it is enough to know the position and 
velocity of each point of the string at time ¢ = 0: 


(3) u(x, 0) = 2(%), = ux, 0)= A(x),  xe[0,7]. 


We shall discuss the derivation of (1) later in this section. 
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As in the case of the heat equation we begin by formulating a corre- 
sponding problem for periodic distributions and solving it. The conditions 
(2) suggest that we extend g, h to be odd and periodic and look for a solution 
periodic in x. The procedure is essentially the same as in §3. 


Theorem 5.1. For each G, HE there is a unique family (F,):50 © F 
with the following properties: 


£ F, exists for each s > 0, 
(4) (5) F = D*F, alls > 0, 
(5) F.—> G (F’) as t—+0, 
(6) £ Fi > H (#) as s-—>0. 


Proof. Let (b,)2. be the sequence of Fourier coefficients of G, and let 
(c,)~ » be the sequence of Fourier coefficients of H. If (F;):> 1s such a family 
of distributions, let the Fourier coefficients of F; be 


(a,(t))= w- 


As in the proof of Theorem 3.1, conditions (4), (5), (6) imply that a, is 
twice continuously differentiable for t > 0, a, and Da, have limits at ¢t = 0, 


and 
D?a,(t) — —n*a,(t), 
a,(0) = b,, Da,(0) = Cn: 


The unique function satisfying these conditions (see §6 of Chapter 2) is 
(7) a,(t) = b, cos nt + n~*c, sin nt, n #0, 
(8) Ap(t) = bo + Col. 


Thus we have proved uniqueness. On the other hand, the functions (7), (8) 
satisfy 


(9) Jan(t)| < [bn] + |n|~*len|, 2 #0, 
(10) a,(t)—>b, as t—+0, all n; 
(11) |Da,(t)| < |nbp| + len], all, 
(12) Da,(t)—>c, as t—>O0, all n. 


It follows from (9) and Theorem 1.2 that (a,(t))2.. is the sequence of Fourier 
coefficients of a distribution F;. It follows from (10) and Theorem 2.3 that 
(5) is true. It follows from (11), the mean value theorem, and Theorem 1.2 
that 
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exists for s > 0 and has Fourier coefficients (Da,(s))2 .. Then (12) gives (6). 
Finally, 


| D?a,(t)| < |n?b,| + |nc,|, all n. 


d 2 
(a) * . 


exists, s > 0. The choice of a,(t) implies that (4) holds. { 


It follows that 


Let us look more closely at the distribution F, in the case when G and H 
are the distributions defined by functions g and A in F First, suppose h = 0. 
The Fourier series for g converges: 


g(x) = S Q, exp (inx). 


Inequality (9) implies that the Fourier series for F, also converges; then F, 
is the distribution defined by the function u,¢ A where 


u(x, t) = u(x) = > A, COS nt-exp (inx) 


=5 > a,[exp (int) + exp (—int)] exp (inx) 


= ; > a, exp (in(x + t)) + ; > Ay, exp (in(x — t)), 
or 
(13) u(x, t) = 4g(x + t) + 4g(x — f). 
It is easily checked that u is a solution of 


Cu *u 


Ou 
or? —_ Ox2’ u(x, 0) = g(x), OL (x, 0) = 0. 


Next, consider the case g = 0. Again (a,(t))2 ~ is the sequence of Fourier 
coefficients of a function u, € P, 


u(x, t) = u(x) = > n-*c,, sin nt-exp (inx) + Cof. 
n#0 
To get rid of the n~* factor, we differentiate: 
C So, . 
Ox u(x, t) = 2, ic, sin nt exp (inx) 


co 


= ; > Calexp (int) — exp (—int)] exp (inx) 


— ; > C, exp (in(x + t)) — 5 > C, exp (in(x — ft)) 


= $h(x + t) — Zh(x — ft). 
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Integrating with respect to x gives 
1 x+E 
(13) uc = 5] ho) dy + at) 
x—t 


where a(t) is a constant to be determined so that u is periodic as a function 
of x. But the periodicity of h implies that a(t) should be taken to be zero. 
Then u so defined in a solution of 


O2u = 0*u 


Ou 
ve: = aye u(x, 0) = 0, Or (x, 0) = h(x). 


Equation (1) for the vibrating string may be derived as follows. Approxi- 
mate the curve u, representing the displacement at time ¢ by a polygonal line 
joining the points ...(x — «, u(x — e, t)), (x, u(x, t)), (x + 2, u(x + 2, t)),.... 
The force on the string at the point x is due to the tension of the string. In 
this approximation the force due to tension is directed along the line seg- 
ments from (x, u(x, t)) to (x + e, u(x + e, t)). The net vertical component 
of force is then proportional to 


(5) ‘uc, t) & ce [u(x + 6) + u(x — 2) — 2u(n)] 


where c is constant. We take the limit as e—0. Two applications of 
L’H6pital’s rule to the right side considered as a function of « give (1). 
The constant c? can be seen to be equal to 


qtr 2, 


where r is the diameter of the string or wire, 7 is the tension, and « is pro- 
portional to the density of the material. Let us suppose also that the length 
of the string is z/ instead of 7. The solution of the problem corresponding 
to (1), (2), (3) when g is real and / = 0 can be shown to be of the form 


(14) u(x, t) = S b,, cos (ent/1) sin (nx/1). 
A single term 
b,, cos (cnt/1) sin (nx/L) 


represents a “standing wave’’, a sine curve with n maxima and minima in 
the interval [0, z/] and with height varying with time according to the term 
cos (cnt/]). Thus the maximum height is 5,, and the original wave is repeated 
after a time interval of length 


2nl/cn. 
Thus the frequency for this term is 
cn/2al = n(«r)*!?/2lr, 
an integral multiple of the lowest frequency 


(15) (xr)? /2aIr. 
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In hearing the response of a plucked string the ear performs a Fourier 
analysis on the air vibrations corresponding to u(x, t). Only finitely many 
terms of (14) represent frequencies low enough to be heard, so the series 
(14) is heard as though it were a finite sum 


(16) S b,, cos (cnt/1) sin (nx/I). 


In general, if the basic frequency (15) is not too low (or high) it is heard as 
the pitch of the string, and the coefficients b,, b.,...,b, determine the 
purity: a pure tone corresponds to all but one b, being zero. Formula (15) 
shows that the pitch varies inversely with length and radius, and directly 
with the square root of the tension. 


Exercises 


1. In Theorem 5.1, suppose DG and H are in L?. Show that 


d 
DF, and di F, 


are in L? for each ¢, and 


d 2 
DFP + [5 Fi = LG? + [a 


2. In the problem (1), (2), (3) with c = 1 suppose g, h, and u are smooth 
functions. Show, by computing the derivative with respect to t, that 


a 2 
| dx + | 
0 


tt 
0 
is constant. (This expresses conservation of energy: the first term represents 
potential energy, from the tension, while the second term represents kinetic 
energy.) 
3. Use Exercise 2 and the results of this section to prove a theorem about 
existence and uniqueness of classical solutions of the problem (1), (2), (3). 
4. Show that if H(f) = 0 when f is constant, then the solution (F;)50 
in Theorem 5.1 can be written in the form 


F, = 4(7,G + T_,G) + (T_,SH — T,SH). 


7 


4) 0 2 
ax u(x, t) ot u(x, t) dx 


Here again T; denotes translation, while S is the operator from #’ to #” 
defined by 
SH(u) = H(v), ucZ 


where 


v(x) = | ” t) dt + = [ v(t) dt. 
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This is the analog for distributions of formulas (13) and (13)’. 


§6. Laplace’s equation and the Dirichlet problem 


A third equation of mathematical and physical importance is Laplace’s 
equation. In two variables this is the equation 


C7u  O7u 
(1) xe + ay = 0. 


A function wu satisfying this equation is said to be harmonic. A typical 
problem connected with this equation is the Dirichlet problem for the disc: 
find a function continuous on the closed unit disc 


(a, y) |x? + y? < 1B 
and equal on the boundary to a given function g: 
ux,y)= g(x,y), wt+yal. 


A physical situation leading to this problem is the following. Let u(x, y, t) 
denote the temperature at time ¢ at the point with coordinates (x, y) on a 
metal disc of radius 1. Suppose the temperature at the edge of the disc is 
fixed, though varying from point to point, while the interior of the disc is 
insulated. Eventually thermal equilibrium will be reached: u will be inde- 
pendent of time. The resulting temperature distribution u is approximately a 
solution of (1), (2). 

To solve this equation we express u and g by polar coordinates: 


u=u(r,9), g =2g(9) 
where 
x=rcos#, y=rsin0@; r= (x? + y?)2, 6 = tan7! (yx—). 
Then 


Ou _ Cudr | UO _ og g Ou _ Sin 8 ou 
dx Orox | d0ax ~->” Oo r oo 


and similarly for du/éy. An elementary but tedious calculation gives 
au, tu _ Ou | Lau | 1 Ou 
0x? © Oy? Or? ° ror r* 06? 

Thus we want to solve 


O74 Cu 0*u 
2 ———— — ——s 
(3)  br2 | op * 562 


(4) u(1,@) = g(9), eR, 


= Q, 0<r<l, GER, 
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where u and g are periodic in 0. We proceed formally. Suppose g has Fourier 
coefficients (5,)2 ., and u,(@) = u(r, 0) has Fourier coefficients 


(a,(r))2 ws O0<r<l. 
Then (3) leads to the equation 
(5) r? Da, + rDa, — na, = 0, 0O<r<l. 


From (4) we get 


(6) a,(1) = b,, 
and since u,(@) is constant we want 
(7) a,0)=0, n#0. 


We look for a solution a,(r) of the form b,r°, where c = c(n) is a constant, 
for each n. This will be a solution if and only if 


c(c-—1)+cec-—-n?=0, 


or c? = n®. Then (7) gives c = |n|. We are led to the formal solution 
u(r, 8) = > Bar'"! exp (in8). 


Formally, this should be the convolution of g with the distribution whose 
Fourier coefficients are 


(rimlyo.. 
For r < 1 these are the Fourier coefficients of a function P, € Z In fact, 


co 


(8) P,(6) = > rirleine 


=~ © 


1+ > (re')" 4 > (re- 19)" 
1 1 
= (1 — r?)|1 — re®|-2 = (1 — r?)(1 — 2rcos 6 + r*)-}, 


Note that P,(@) is an approximate identity as r — 1: the first expression on 
the right in (8) shows 


l 2n 
az | P,(6) dd = 1 


and the last expression shows that P,(#) > 0 and 
21-6 
lim P,(0) dé = 0 
ral 6 
for each 0 < 6 < az. The function 
P(r, 0) = P,(6) 


is called the Poisson kernel for the Dirichlet problem in the unit disc. 
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Theorem 6.1. Suppose F is a periodic distribution. There is a unique 
function u(r, 6) defined and smooth in the open unit disc, satisfying (3), and 
such that the distributions defined by the functions u,(@) = u(r, 8) converge 
to F in the sense of F’ as r —> 1. Moreover, if F = F, where g € @, then u, + g 
uniformly as r —> 1. The functions u, are given by convolution with the Poisson 
kernel: 


u, = Fx P,. 


Proof. Uniqueness was proved in the derivation above. Let u, = F * P,. 
Since P, is an approximate identity, we do have u,—> F (#’) as r—>1, and 
u,—>g uniformly if F = F,, g¢@. We must show that u is smooth and 
satisfies (3). Note that when F = F,, g € @, then explicitly 


| Qn 
u(r, 8) = 5 P(r, 8 — ~)g(@) dp 
6) 


and we may differentiate under the integral sign to prove that u is smooth. 
Moreover, since P(r, 9) satisfies (3), so does uw. 

Finally, suppose u has merely a distribution F as its value on the boundary. 
Note that if0 < r,s < 1 then (by computing Fourier coefficients, for example) 


P,* Ps = Prys. 


In particular, choose any R > 0, R < 1. It suffices to show that u is smooth 
in the disc r < R and satisfies (3) there. But when r < R, 


s=rRi<1 
and 
u, = Fx P, = Fx (P,* P,) = (F * Pr) * Py = Up * Py. 
Since Pp € F uz is a smooth function of 0. Then 
ir, 8) = go | PRM, 8 — pus) de 
0<r< R. Again, differentiation under the integral sign shows that uw is 


smooth and satisfies (3). { 


The preceding theorem leads to the remarkable result that a real-valued 
harmonic function is (locally) the real part of a function defined by a con- 
vergent power series in z = x + iy. 


Theorem 6.2. Suppose u is a harmonic real-valued function of class C? 
defined in an open subset of R? containing the point (Xo, Yo). Then there is a 
function f defined by a convergent power Series: 


f{(Z) = > an(z — 2)", Iz — Z| <-e, Zo = Xo + Wo, 
0 


such that 
u(x, y) = Re f(x + iy) 
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when 
|x + ly — Zo| < é& 


Proof. Suppose first that (xo, yo) = (0, 0) and that the set in which wu is 
defined contains the closed disc x? + y? < 1. Let 


g(@) = u(cos 9, sin 6), GER. 


Then uw is the unique solution of the Dirichlet problem in the unit disc with 
g as value on the boundary. If (6,)°. are the Fourier coefficients of g, 
then we know that in polar coordinates u is given by 


> r'"lb exp (iné). 


Since u is real, g is real. Therefore b, = b*,, and the series is 


bp + 2 Re (> r"b, exp (ind)), 
1 


Let f be defined by 


f (z) = > a,Z" 
0 
where 


Ayo = bo; a, = 2b,, n> 0. 


Then 


u(x, vy) = Re (> antre)) = Re (/(re"”)) 
=Re(f(x+iy)), 2+y? <1. 


In the general case, assume that wu is defined on a set containing the closed 
disc of radius « centered at (Xo, yo), and let 


U(X, Y) = U(Xo + eX, Yo + ey). 
Then uw, is harmonic in a set containing the unit disc, so 
u, = Refi, 
Ff; defined by a power series in the unit disc. Then 


u=Ref, (x —%X)? +(¥ — Yo)? <, 
where 


f(x, y) = file“ (x — Xo), ey — Yo)) 


is defined by a power series in the disc of radius e around Zz) = X) + iyo. OQ 
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Exercises 


1. Prove the converse of Theorem 6.2: if 


f@)=Salz—2),  |z—2l <R, 


and we let 
u(x,y)= Ref(x+iy), |[x+iy— 2| < R, 


then uw is harmonic. 
2. There is a maximum principle for harmonic functions analogous to 
the maximum principle for solutions of the heat equation discussed in §4. 
(a) Show that if u is of class C? on an open set A in R? and 
Ou n 07u 5 0 
Ox? dy? 
at each point of A, then u does not have a local maximum at any point of A. 
(b) Suppose u is of class C? and harmonic in an open disc in R? and 
continuous on the closure of this disc. Show, by considering the functions 


u(x, y) = u(x, y) + ex? + ey? 


that u attains its maximum on the boundary of the disc. 

3. Use the result of Exercise 2 to give a second proof of the uniqueness 
of the solution of the Dirichlet problem for a continuous boundary function g. 

4. Suppose uw is continuous on the closed disc x? + y? < Rand harmonic 
in the open disc x? + y? < R. Give a formula for u(x, y) (or u(r, 6)) for 
x? + y? < Rin terms of the values of u for x? + y? = R. Give formulas 
for the derivatives 0u/oOr and éu/0é also. 

5. Suppose wu is defined on all of R? and is harmonic. Use the result of 
Exercise 4 to show that if uw is bounded, then it is constant. 
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Complex Analysis 


§1. Complex differentiation 


Suppose © is an open subset of the complex plane C. Recall that this 
means that for each z) € Q there is a 6 > 0 so that 2 contains the disc of 
radius 6 around Zp: 


zEQ if |z — z| < 6. 
A function f: Q — C is said to be differentiable at z € Q if the limit 
f(w) — f) 


lim ————_——— 
WoZ w—Z 


exists. If so, the limit is called the derivative of f at z and denoted /’(z). 

These definitions are formally the same as those given for functions 
defined on open subsets of R, and the proofs of the three propositions below 
are also identical to the proofs for functions of a real variable. 


Proposition 1.1. Jf f: Q-—>C is differentiable at z¢Q, then it is con- 
tinuous at Z. 


Proposition 1.2. Suppose f: Q->C, g:Q—C andaeC. If f and g are 
differentiable at z = Q, then so are the functions af, f + g, and fg: 


(af)'(z) = af"(Z) 
(f+ g)(2) =f'@ + 8) 
(fg)'(z) = f(g) + f@s'@). 


If also g(z) 4 0, then f/g is differentiable at z and 
(f/g)'(z) = Lf’ a2) — f@s'@ 1a”. 


Proposition 1.3 (Chain rule). Suppose f is differentiable at z¢C and g 
is differentiable at f(z). Then the compositive function g of is differentiable 
at z and 


(gof)'(2) = g'U@)F'@. 


The proof of the following theorem is also identical to the proof of the 
corresponding Theorem 4.4 of Chapter 2. 


Theorem 1.4. Suppose f is defined by a convergent power series: 


f(z) = > a(z—%,  |z-20| < B 
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Then f is differentiable at each point z with |z — z | < R, and 
f'(2) = 2 na,(Z _ Zo)"~*. 


In particular, the exponential and the sine and cosine functions are 
differentiable as functions defined on C, and (exp z)’ = exp z, (cos z)’ = 
—sin z, (sin Zz)’ = cos z. 

A remarkable fact about complex differentiation is that a converse of 
Theorem 2.4 is true: if fis defined in the disc |z — z,)| < Rand differentiable 
at each point of this disc, then f can be expressed as the sum of a power 
series which converges in the disc. We shall sketch one proof of this fact in 
the Exercises at the end of this section, and give a second proof in §3 and a 
third in §7 (under the additional hypothesis that the derivative is continuous). 
Here we want to give some indication why the hypothesis of differentiability 
is So much more powerful in the complex case than in the real case. Consider 
the function 


f(z) =2*, or f(xt+iy)=x—-J. 
Take te R, t 4 0. Then 


t~*[f(z + t) - f(2)] = 1, 
(it) “Lf + it) -f@)] = -1, 
so the ratio 


w*[f(w) — f()] 


depends on the direction of the line through w and z, even in the limit as 
w — z, Therefore this function fis not differentiable at any point. 
Given f: OQ — C, define functions u, v by 


u(x, y) = Re f(x + iy) = $f + iy) + 44 + iy))*, 
v(x, ») = Im f(x + iy) = 5 (f+ y))* — S/O + 0). 
Thus 
f(x + iy) = u(x, y) + iv, y). 
We shall speak of u and v as the real and imaginary parts of f and write 
f=utiv. 


(This is slightly incorrect, since f is being considered as a function of ze 
Q2 < C, while uw and v are considered as functions of two real variables x, y.) 


Theorem 1.5. Suppose Q < C is open, f: QC, f = u + iv. Then if f 
is differentiable at z = x + iy € Q, the partial derivatives 


(1 fu au eb 
Ox oy ax Oy 
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all exist at (x, y) and satisfy 


(2) 5m 9) = 5 06%, 9) 


3) 5m») = - 5 (x, 9), 


Conversely, suppose that the partial derivatives (1) all exist and are con- 
tinuous in an open set containing (x, y) and satisfy (2), (3) at (x, y). Then f is 
differentiable at z = x + iy. 


The equations (2) and (3) are called the Cauchy-Riemann equations. 
They provide a precise analytical version of the requirement that the limit 
defining f’(z) be independent of the direction of approach. Note that in the 
example f(z) = z* we have 


Proof. Suppose f is differentiable at z = x + iy. Then 
(4) lim rf + 1) — f@)] =f'@ = lim @) +f + it) - FO) 


The left side of (4) is clearly 


7) . oO 
By Or Y) + ix U(x, Y); 


while the right side is 


0 F) 
—t5, u(x, y) + By u(x, y). 


Equating the real and imaginary parts of these two expressions, we get (2) 
and (3). 

Conversely, suppose the first partial derivatives of u and v exist and are 
continuous near (x, y), and suppose (2) and (3) are true. Let 


h=a+ib 


where a and b are real and near zero, h # 0. We apply the Mean Value 
Theorem to u and v to get 


fzZ+h-f2@=fe@+a+ ib) -—fz@+a+f(z+a —- f(z) 


7) _ oO 
= Fux + ay + t,b)b + ig, Ue + ay + tab)b 


+ ue + tha Ja + i v(x + tra )a 
Ox 34, V oy 4@, Y)a, 
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where 0 < t,; < 1,7 = 1, 2, 3, 4. Because of (2) and (3), 


| Fux, ») + 15 0b io 


= — 5, ue y)b + ix Sox, y)b + a u(x, y)a + ix = ol, y)a. 
Therefore 
O rg) 
-—] _ _ ___ , 
hUfle +b) ~ f@)) ~ | Eute,») + 15 06 »)| 

is a sum of four terms similar in size to 
g (x + + t,b) — o (x, y) 
By Mo a,y l By HOY): 


Since the partial derivatives were assumed continuous, these terms —> 0 as 
h->0 | 

A function f: Q + C, Q open in C, is said to be holomorphic in Q if it is 
differentiable at each point of Q and the derivative f’ is a continuous function. 
(Actually, the derivative is necessarily continuous if it exists at each point; 
later we shall indicate how this may be proved.) Theorem 1.5 has the following 
immediate consequence. 


Corollary 1.6. f: Q—>C is holomorphic in Q if and only if its real and 
imaginary parts, u and v, are of class C* and satisfy the Cauchy-Riemann 
equations (2) and (3) at each point (x, y) such that x + ive Q. 


Locally, at least, a holomorphic function can be integrated. 


Corollary 1.7. Suppose g is holomorphic in a disc \z — Z| < R. Then 
there is a function f, holomorphic for |z — Zo| < R, such that f’ = g. 


Proof. Let u, v be the real and imaginary parts of g. We want to deter- 
mine real functions g, r such that 
f=qtir 
has derivative g. Because of Theorem 1.5 we can see that this will be true 
if and only if 


The condition that u, —v be the partial derivatives of a function g is (by 
Exercises 1 and 2 of §7, Chapter 2) 


ou _av 
Oy = ax 
The condition that v and u be the partial derivatives of a function r is 


év _ au 
dy Ox 
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Thus there are functions g, r with the desired properties. {J 


Exercises 


1. Let f(x + iy) = x? + y?. Show that fis differentiable only at z = 0. 

2. Suppose f: C — R. Show that fis differentiable at every point if and 
only if fis constant. 

3. Let f(0) = 0 and 


f(x + iy) = 2xy(x? + y?)-4, x+iy #0. 


Show that the first partial derivatives of f exist at each point and are both 
zero at x = y = 0. Show that fis not differentiable (in fact not continuous) 
at z = 0. Why does this not contradict Theorem 1.5? 

4. Suppose fis holomorphic in Q and suppose the real and imaginary 
parts u, v are of class C? in Q. Show that u and v are harmonic. 

5. Suppose f is as in Exercise 4, and suppose the disc |z — zo| < R is 
contained in ©. Use Exercise 4 together with Theorem 6.2 of Chapter 5 to 
show that there is a power series > a,(zZ — Z))" converging to f(z) for 
|Z — Z| < R. 

6. Suppose g is holomorphic in Q, and suppose © contains the disc 
|Z — Z| < R. Let f be such that f(z) = g(z) for |z — z| < R (using 
Corollary 1.7). Show that the real and imaginary parts of fare of class C?. 

7. Use the results of Exercises 5 and 6, together with Theorem 1.4, to 
prove the following theorem. 

If gis holomorphic in Q and Zy € Q, then there is a power series such that 


g(z)= > al(z— 2)", |z—Z| < R 
n=0 


for any R such that Q contains the disc of radius R with center Zp. 


§2. Complex integration 


Suppose Q < C is open. A curve in Q is, by definition, a continuous 
function y from a closed interval [a, b] < R into Q. The curve y is said to be 
smooth if it is a function of class C+ on the open interval (a, b) and if the 
one-sided derivatives exist at the endpoints: 


(t — a)~*[)(t) — y(a)] convergesas t—a,t> a; 
(t — b)~*[y(t) — y(b)] convergesas t—b,t < b. 


The curve y is said to be piecewise smooth if there are points do, a;,..., 
with 
A= <a, <°:::<a=b 
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such that the restriction of y to [a;_,, a,] is a smooth curve, 1 <j <r. An 
example is 


y(t) = Z + e exp (it), t € [0, 27]; 
then the image 
{y(t) | t € [0, 2x} 


is the circle of radius e around Zp. This is a smooth curve. A second example is 


y(t) = t, t  [0, 1], 
y(t) = 1+ it — 1), te (1, 2], 
y(t) =1+i-(t — 2), t € (2, 3], 
y(t) =i- (t — 3)i, t € (3, 4]. 


Here y is piecewise smooth and the image is a unit square. 
Suppose y: [a, b] — © is a curve and f: 2—> C. The integral of f over y, 


[4 

Y 

is defined to be the limit, as the mesh of the partition P = (fo, t,,..., ¢,) of 
[a, b] goes to zero, of 


> folie) - HD) 


Proposition 2.1. Jf y is a piecewise smooth curve in QO and f:Q—>C is 
continuous, then the integral of f over y exists and 


(1) { t= [ sooo a. 


Proof. The integral on the right exists, since the integrand is bounded 
and is continuous except possibly at finitely many points of [a, b]. To prove 
that (1) holds we assume first that y is smooth. Let P = (fo, t,,..., t,) be a 
partition of [a, b]. Then 


(2) > SOM) — 74D = > SOO — 4-11 + R, 


where 
|R| < sup [f|-> vt) — vti-2) — y(t — 4-2). 


Applying the Mean Value Theorem to the real and imaginary parts of y on 
[t;-1, t;] and using the continuity of y’, we see that 


R->0 asthe mesh |P|—0. 


On the other hand, the sum on the right side of (2) is a Riemann sum for 
the integral on the right side of (1). Thus we have shown that the limit exists 
and (1) is true. 
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If y is only piecewise smooth, then the argument above breaks down on 
intervals containing points of discontinuity of y’. However, the total contri- 
bution to the sums in (2) from such intervals is easily seen to be bounded in 
modulus by a constant times the mesh of P. Thus again the limit exists as 
|P| > 0 and (1) is true. Qj 


Note that the integral f, f depends not only on the set of points 
{y(t) | t € [a, b]} 
but also on the “sense,” or ordering, of them. For example, if 
v(t) = exp (it), y(t) = exp (—it), LE [0, 27], 


then the point sets are the same but 


Lf -lt 


Furthermore, it matters how many times the point set is traced out by y: if 


y(t) = exp (int), 


| fan] s 


A curve y: [a, b] > Q is said to be constant if y is a constant function. 
If so, then 


then 


[ r=0. all f. 


A curve y: [a, b] > Q is said to be closed if y(a) = »(b). (All the examples 
given so far have been examples of closed curves.) Two closed curves 
Yo. ¥1: [a, 6] > Q are said to be homotopic in Q if there is a continuous 
function 

I: [a, b] x [0, 1] ~ Q 


such that 


PG, 0) = volt), Dt, 1) = vi(t), all te [a, b], 
(a, s) = TQ, s), all s € [0, 1]. 


The function I is called a homotopy from yp to y;. If I is such a homotopy, 
let 


y(t)= TGs), se(O, 1). 
Then each y, is a closed curve, and we think of these as being a family of 
curves varying continuously from yp to y,, within Q. 


Theorem 2.2 (Cauchy’s Theorem). Suppose 2 < C is open and suppose 
f is holomorphic in Q. Suppose yo and y, are two piecewise smooth closed 
curves in Q which are homotopic in Q. Then 


aan 
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The importance of this theorem can scarcely be overestimated. We 
shall first cite a special case of the theorem as a Corollary, and prove the 
special case. 


Corollary 2.3. Suppose Q is either a disc 
{z||z — Z| < R} 
or a rectangle 
{x + iy | x1 < xX < Xa, V1 < y < poh. 


Suppose f is holomorphic in Q and y is any piecewise smooth closed curve 


in Q. Then 
[reo 
Y 


Proof. It is easy to see that in this case y is homotopic to a constant 
curve yo, so that the conclusion follows from Cauchy’s Theorem. However, 
let us give a different proof. By Corollary 1.7, or by the analogous result 
for a rectangle in place of a disc, there is a function h, holomorphic in Q, 
such that h’ = f. But then 


[t=] too@voa =| vro@yrod 


=| [ho yV(e) dt = WO) — Aa) = 0, 


since y(a) = y(b). J 
Proof of Cauchy’s Theorem. Let T be a homotopy from yp to y,; and let 
y(t) = T(t, s), t € [a, b], s€(0, 1). 


Assume for the moment that each curve y, is piecewise smooth. We would like 
to show that the integral of f over y, is independent of s,0 < s < 1. 

Assume first that I’ is of class C? on the square [a, b] x (0, 1), that the 
first partial derivatives are uniformly bounded, and that 


Ys as s—-O0, 
Ys yy. as sl 


uniformly on each interval (c, d) < [a, b] on which yo or yj, respectively, is 
continuous. These assumptions clearly imply that 


Fey=| f=] WON Gsa 
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is a continuous function of s, s € [0, 1]. Furthermore under these assumptions 
we may apply results of §7 of Chapter 2 and differentiate under the integral 
sign when s (0, 1) to get 


Fs) = | = ( fC, ))< r(t, =) dt 
> Q F 
~ | CE NEM) ZTGs) dt 
+ | £069) 525, TGs) at 
-| Z[rran gras a 


= fT, 8) <1, s) — AT@, 8) <1, s) 
= 0 


(since ['(b, s) = T(a, s)). Thus F(O) = F(1). 

Finally; do not assume I’ is differentiable. We may extend [ in a unique 
way so as to be periodic in the first variable with period b — a, and even and 
periodic in the second variable with period 2; then [T:R x R—Q. Let 
gy: RR be a smooth function such that 


g(x) = 0, all x, 
| o@) dx = | 
o(x)=0 if |x| = 1. 
Let 9,(x) = no(nx), n = 1, 2, 3,.... Then 
xo dx=1, (x) = Oif |x| > 174. 
Let 
P(t, 8) = | PG = x5 — vpn dea(y) dx dy, 
(t,s)€R x R. Then (see the arguments in §3 of Chapter 3) I’, is also periodic 
with the same periods as I’, and [,,->T uniformly as n— oo. It follows 


from this (and the fact that [IR x R) is a compact subset of Q) that 
r.(R x R) ¢ Q if 7 is sufficiently large. Furthermore, the argument above 


shows that 
| r-| 4 
Yo,n Yi,n 


Yat) =T,,5), tela, bj, se[0, 1). 


where 
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All we need do to complete the proof is show that 
(3) | f>|f as n>, s=Oorl. 
Ys,n Ys 


Note that we may choose the homotopy I so that y, = yp forO < s < 4 
and y, = y, for 4 < s < 1; to see this, let ') be any homotopy from yp to 
y, in Q, and let 


P(t, 5) = yo(t), 0<s <i, 
I(t, s) = T(t, 3s — 1), t<s < 4, 
P(t, s) = yi(t), Z<s< 1. 


If I’ has these properties, then when n > 3 we have 
l(t, s) = | I(t — x, s)p,(x) dx, s=0Oorl. 
It follows that y,,, > y, uniformly, s = 0 or 1. It also follows, by differen- 
tiating with respect to ¢, that y,, is uniformly bounded and 
Ys,n—> Vs 


on each interval of [0, 1] where y; is continuous, s = 0 or 1. Therefore (3) 
is true. {j 


Exercises 


1. Suppose y,: [a, b] > Q and yz: [c, d] > Q are two piecewise smooth 
curves with the same image: 


C = {y,(t) | t€ [a, b]} = {y2(t) | t€ [e, d]}. 
Suppose these curves trace out the image in the same direction, i.e., if 


s,té[a, b], s’, t’e[c, d] 


and 
yi(s) = vals’) # yi(t) = vat’) 
and 
s<t 
then 
s’<t’ 


Show that for any continuous f: Q — C (not necessarily holomorphic), 


ian 
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This justifies writing the integral as an integral over the point set C: 
| f= | soa, 
Y1 C 
where we tacitly assume a direction chosen on C. 
2. Suppose C in Exercise | is acircle of radius R and suppose | f(z)| < M. 


all ze C. Show that 


| I f(@ dz| < M-2nR. 


3. Let y(t) = 2) + ce”, t € [0, 27], where « > 0. Show that 
| (Z — Z9)7+ = 2zi. 
y 


4. Let y,(t) = exp (int), te [0, 27], nm = 0, +1, +2,.... Show that 


| z~} = 2nzi. 
y 


n 


5. Let Q = {zeC|z F 0}. Use the result of Exercise 4 to show that the 
curves y, and y,, are not homotopic in Q if n 4 m. Show that each y, is 
homotopic to yp in C, however. 


6. Use Exercise 4 to show that there is no function f, defined for all z 4 0, 
such that f’(z) = z~+, all z 4 0. Compare this to Corollary 1.7. 
7. Let Q be a disc with a point removed: 


Q = {z||z — 29] < R,z # 23}, 
where |z — z,| < R. Let yo and y, be two circles in Q enclosing z,, say 


yo(t) = 21 + ee", = te [0, 2z], 
yi(t) = z + re", t € [0, 27]. 


Here |z — z,| < r < R and e > 0 is chosen so that |y,(t) — z,| > e, all ¢. 
Construct a homotopy from yo to 74. 
8. Suppose 2 contains the square 


{x + iy|0< x,y < 1}. 


Suppose fis differentiable at each point of Q; here we do not assume that f’ 
is continuous. Let C be the boundary of the square, with the counterclockwise 
direction. Show that 


| fz) dz = 0. 


This extension of Corollary 2.3 is due to Goursat. (Hint: for each integer 
k > 0, divide the square into 4" smaller squares with edges of length 27". 
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Let Ci31,..-,; Cy.2, be the boundaries of these smaller squares, with the 
counterclockwise direction. Then show that 


fejde=>| fe az. 
I | 


j “Ck, 


It follows that if 


70 


| f(z) dz\ = M > 0, 


then for each k there is a j = j(k) such that 


f(z) dz| = 4-*M, 


Cr 
where C, = C,,;. Let z, be the center of the square with boundary C,,. 
There is a subsequence z,, of the sequence z, which converges to a point 
z of the unit square. Now derive a contradiction as follows. Since f is 
differentiable at z, 


fw) = fle) + fw — 2) + rw) 
where 
Ir(w)(w — z)"*|>0 as wz 


Therefore for each « > 0 there is a 6 > O such that if Cy is the boundary 
of a square with sides of length / lying in the disc |w — z| < 6, then 


| f(z) dz 


< eh”, 


§3. The Cauchy integral formula 


There are many approaches to the principal results of the theory of 
holomorphic functions. The most elegant approach is through Cauchy’s 
Theorem and its chief consequence, the Cauchy integral formula. We begin 
with a special case, which itself is adequate for most purposes. 


Theorem 3.1. Suppose f is holomorphic in an open set Q. Suppose C is a 
circleor rectangle contained in Q and such that all points enclosed by C are 
in Q. Then if w is enclosed by C, 


(1) fo) = 5 | Fae — w)* de 


(Here the integral is taken in the counterclockwise direction on C.) 


Proof. Let yo: [a, b] ~ Q be a piecewise smooth closed curve whose 
image is C, traced once in the counterclockwise direction. Given any positive 
e which is so small that C encloses the closed disk of radius and center w, 
let C, be the circle of radius « centered at w. 


The Cauchy integral formula 167 


We can find a piecewise smooth curve y,: [a, b] — Q which traces out C, 
once in the counterclockwise direction and is homotopic to yp in the region 
Q with the point w removed. Granting this for the moment, let us derive (1). 
By Exercise 1 of §2, and Cauchy’s Theorem applied to g(z) = f(z)(z — w)7}, 
we have 


| f@@-wta=| e=| =| foe- we, 
SO 
(2) | f@(z — w)-1dz = | fOw\(z — w)-4 dz + | Lf(w) — fz — w)-* az. 


By Exercise 2 of §2, the first integral on the right in (2) equals 27if(w). Since 
fis differentiable, the integrand in the second integral on the right is bounded 
as e—> (0. But the integration takes place over a curve of length 2ze, so this 
integral converges to zero as e —> 0. Therefore (1) is true. 

Finally, let us construct the curve y, and the homotopy. For te [a, b], 
let y(t) be the point at which C, and the line segment joining w to y,(f) 
intersect. Then for 0 < s < 1, let 


T(t, 5) = (1 — s)yo(t) + syi(t). 
It is easily checked that y, and I’ have the desired properties. [ 


The preceding proof applies to any situation in which a given curve y, is 
homotopic to all small circles around w € Q. Let us make this more precise. 
A closed curve y in an open set Q is said to enclose the point w € Q within Q. 
if the following is true: there is a 5 > O such that if 0 < e < 6 then y is 
homotopic, in 2 with w removed, to a piecewise smooth curve which traces 
out once, in a counterclockwise direction, the circle with radius e and center 


w. We have the following generalization of Theorem 3.1. 


Theorem 3.2. Suppose f is holomorphic in an open set Q. Suppose y is a 
piecewise smooth closed curve in Q which encloses a point w within Q. Then 


3) fw) = 5, | SOG - w)-* de 


Equation (3) is the Cauchy integral formula, and equation (1) is essentially 
a special case of (3). (In section 7 we shall give another proof of (1) when C 
is a circle.) 

The Cauchy integral formula makes possible a second proof of the result 
of Exercise 5, §1: any holomorphic function can be represented locally as a 
power series. 


Corollary 3.3. Suppose f is holomorphic in Q, and suppose Q contains 
the disc 


{z | |z — Zo] < R}. 
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Then there is a unique power series such that 
(4) fw) = a4 n(W — Zo)", all w with |w — zo| < R. 
Proof. Suppose 0 < r < R, and let C be the circle of radius r centered 
at Zo, with the counterclockwise direction. If |w — z)| < rand zeC then 
\(w — Zo\(z — Zo)~ | =S< 1. 
We expand (z — w)~+ in a power series: 
Z—W=Z— 2) — (W— Zo) = (Z — Zo)[l — (Ww — Zz — 2)74], 


SO 


(z — w)-+ = (z — Z)7? S (2==)" 


The sequence of functions 


gn(z) = f(z) 2, (w — 2)"(zZ — Z)7"7} 


converges uniformly for z € C to the function 


f(z)(z — w)~*. 


Therefore we may substitute in equation (1) to get (4) with 
_ ~n-1 
(5) an = 55; | See ~ 24) de, 


This argument shows that the series exists and converges for |w — zo| <r. 
The series is unique, since repeated differentiation shows that 


(6) nia, = f(Z0). 


Since r < R was arbitrary, and since the series is unique, it follows that it 
converges for all |w — z)| < R. Q 


Note that our two expressions for a, can be combined to give 
n! 
) So) = Fe, | flee = 20)" de 
Tl Jo 


This is a special case of the following generalization of the Cauchy integral 
formula. 


Corollary 3.4. Suppose f is holomorphic in an open set containing a circle 
or rectangle C and all the points enclosed by C. If w is enclosed by C then the 
nth derivative of f at w is given by 


(8) fw) = sas [. S(z\(z — w)7? dz. 
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Proof. Let C, be a circle centered at w and enclosed by C. Then by the 
Cauchy integral theorem, and the argument given in the proof of Theorem 
3.1, we may replace C by C;, in (8). But in this case the formula reduces to 
the case given in(7). [ 


A function f defined and holomorphic on the whole plane C is said to be 
entire. The following result is known as Liouville’s Theorem. 


Corollary 3.5. Jf fis an entire function which is bounded, then f is constant. 


Proof. We are assuming that there is a constant M such that | f(z)| < M, 
all ze C. It is sufficient to show that f’ = 0. Given we C and R > QO, let C 
be the circle with radius R centered at w. Then 


Fw) = 55 | fle — w)-? a, 
SO 
If'(w)| < 5 M-R-?-20R = MR-1. 


Letting R—~>0 we get f’(w) = 0. O 


A surprising consequence of Liouville’s Theorem is the “Fundamental 
Theorem of Algebra.” 


Corollary 3.6. Any nonconstant polynomial with complex coefficients has 
a complex root. 


Proof. Suppose p is such a polynomial. We may assume the leading 
coefficient is 1: 


D(z) = 2" + ay-yZ"~* +--+ + a,(z) + a. 
It is easy to show that there is an R > O such that 
(8) |p@)| = 3|z/" if |z| = R. 


Now suppose p has no roots: p(z) # 0, all ze C. Then f(z) = p(z)~* would 
be an entire function. Then f would be bounded on the disc |z| < R, and 
(8) shows that it would be bounded by 2R~" for |z| = R. But then f would be 
constant, a contradiction. 


Exercises 


1. Verify the Cauchy Integral Formula in the form (1) by direct com- 
putation when f(z) = e* and Cis a circle. 

2. Compute the power series expansion (4) in the following cases. (Hint: 
(6) is not always the simplest way to obtain the a,.) 

(a) f(z) = sin Z, Zo = 47. 

(b) e?, Zo arbitrary. 
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(c) f(z) = 22 — 2274+ 2+ 5,2 = —i. 

(d) f(zZ) = @ — Iz? + :+1:)74, Zo = 2. 

3. Derive equation (8) directly from (1) by differentiating. 

4. Suppose f is an entire function, and suppose there are constants M 
and n such that 


[f(—)| < Md + |z|)",— allzeC. 


Show that f is a polynomial of degree <n. Show that, conversely, any 
polynomial of degree <n satisfies such an inequality. 

5. The Cauchy integral formula can be extended to more general situa- 
tions, such as the case of a region bounded by more than one curve. For 
example, suppose Q contains the annulus 


A = {z|r < |z — Z| < R}, 
and also the two circles 
C, = {z||z-—2| =r}, C.={z||z-—z|=R 


which bound A. Give C, and C, the counterclockwise direction. Then if 
w € A, and fis holomorphic in Q, show that 


© fw)=75) foe - werd — 55] se - wae, 


(Hint: choose ae C, |a] = 1 so that w does not lie on the line segment 
L = {z% + ta|r < t < R} joining C, to C.. There is a curve y tracing out 
L, then C, in the counterclockwise direction, then L in the reverse direction, 
then C, in the clockwise direction. This curve is homotopic in Q to any 
small circle about w. Moreover, the integral of f(z)(z — w)~+ over y equals 
the right side of (*), since the two integrations over L are in the opposite 
directions and cancel each other.) 

6. Extend the result of Exercise 5 to the following situation: Q contains 
a circle or rectangle C, together with all points enclosed by C except 
21, Za,+-+52Zm. Let Cy, Co,..., Cy be circles around these points which do 
not intersect each other or C. If fis holomorphic in Q and w é€ Q is enclosed 
by C, then 


(00) = 353 | FO - wrt de — Xx] fee - wae 


again, all integrals are taken in the counterclockwise direction. 

7. Suppose f: 2-—>C is merely assumed to be differentiable at each 
point of the open set Q, and suppose © contains a rectangle C and all points 
enclosed by C. Suppose w is enclosed by C. Modify the argument of Exercise 
8, §2, to show that the integral in (1) remains unchanged if we replace C by 
any rectangle enclosed by C and enclosing w. Therefore show that (1) holds 
in this case also. 

8. Use the result of Exercise 7 to show that if fis differentiable at each 
point of an open set Q, the derivative is necessarily a continuous function. 
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$4. The local behavior of a holomorphic function 


In this section we investigate the qualitative behavior near a point z) € C 
of a function which is holomorphic in a disc around Zp. If fis not constant, 
then its qualitative behavior near z, is the same as that of a function of the 
form 

F(Z) = Ap + A(z — 20) 
where dp and a,, are constants, a,, # 0, and m > 1. 


Lemma 4.1. Suppose f is holomorphic in an open set Q and z)€ Q. If 
f is not constant near Zo, then 


(1) F(Z) = ayo + An(Z — 29)"h(z) 

where ay and a», are constants, m >= 1, andhis holomorphic in Q with h(z)) = 1. 
Proof. Near Zo, fis given by a power series expansion 

(2) F(Z) = a + a(Z — 29) +--+ + a,(Z — 2)" +: 

Let m be the first integer >1 such that a,, 4 0. Then (2) gives (1) with 


h(z) = > AnAm (Z — Zo)"~™. 


This function is holomorphic near Zp, and h(z)) = 1. On the other hand, 
(1) defines a function 4 in Q except at z), and the function so defined is 
holomorphic. Thus there is a single such function holomorphic through- 
outQ. G 


Our first theorem here is the Inverse Function Theorem for holomorphic 
functions. 


Theorem 4.2. Suppose f is holomorphic in an open set Q, and suppose 
Z € Q, f’(Z) 4 0. Let Wo = f(Zo). Then there is an e, > 0 and a holomorphic 
function g defined on the disc |w — Wo| < &, Such that 


g({w | |w — wo] < e,}) is open, 
f(g(v)) =w if |w — wol < ex. 


In other words, ftakes an open set containing Zp) in a 1-1 way onto a disc 
about wo, and the inverse function g is holomorphic. 


Proof. We begin by asking: Suppose the theorem were true. Can we 
derive a formula for g in terms of f? The idea is to use the Cauchy integral 
formula for g, using a curve y around w, which is the image by f of a curve 
around Z,, because then we may take advantage of the fact that g(f(z)) = z. 
To carry this out, let 6 > 0 be small enough that © contains the closed disc 
|Z — Z| < 8; later we shall further restrict 5. Let 


yo(t) = Z + de, t € [0, 27], 
y(t) = flvo(t)), 
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and let C be the circle of radius 5 around z). Assuming the truth of the 
theorem and assuming that y enclosed w,, we should have 


(wi) = = | 2(w)(w — w,)7} 
= an | - g(v(t))y’MIyvt) — wil-* at 


= yf OF sOLOs(t) — wil 
or 
| , _ 
(3) g(W,) = al of (z)[f(z) — wy] 1 dz. 


Our aim now is to use (3) to define g and show that it has the desired prop- 
erties. First, note that (1) holds with m = 1. We may restrict 6 still further, 
so that |A(z)| = 4 for ze C. This implies that f(z) 4 wo if ze C. Then we 
may choose e > 0 so that 


f(z) A Wi if [wy — Wo| <€ and ZEC. 


With this choice of 6 and e, (3) defines a function g on the disc |w; — Wo| < «. 
This function is holomorphic; in fact it may be differentiated under the 
integral sign. 

Suppose 


lz1 — Z| <8, f(z:) = wi, and [wy — Wol < «. 


We can, and shall, assume that 6 is chosen so small that f’(z) # 0 when 
|z — Z| < 6. Then 


f(z) — wy = f(z) — fla) = & — 2)k@) 


where k is holomorphic in Q and k(z,;) = f'(z,) ¥ 0. Therefore k is nonzero 
in Q. We have 


l ; _ _ 
avr) = 55 | 2f'@Ke)-Me — 21)" ae, 
Cc 
But the right side is the Cauchy integral formula for 


Zi f'(Z:)k(21)~* = 21. 
Thus g(/(z;)) = 2, for z, near Zo, and we have shown that fis 1-1 near Zo. 


Also 
l = (g of)'(Zo) = 8’(Wo)f' (Zo); 


so g'(Wo) # 0. Therefore g is 1-1 near Wo. We may take e, > 0 so small that 
€) < e, and so that g is 1-1 on the disc |w — wo| < &. We may also assume 


€, < &) SO small that 


|w — wo] < e, implies |g(w) — z| < 6 
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and 
|f(g(w)) — Wol < £0. 


But then for |w — wo| < e; we have 


s(f(g(w))) = g(), 


and since g is 1-1 on |w — wo] < eo this implies 


f(g(w)) = w. 0 


As an example, consider the logarithm. Suppose wo €C, wo 4 0. We 
know by results of §6 of Chapter 2 that there is a z) ¢ C such that e70 = wo. 
The derivative of e* at Z) is e*0 = wy # O. Therefore there is a unique way of 
choosing the logarithm 

z = log w, 


Z near Zo, in such a way that it is a holomorphic function of w near wo. 
(In fact, we know that any two determinations of log w differ by an integral 
multiple of 277; therefore the choice of log w will be holomorphic in an 
open set Q if and only if it is continuous there.) By definition a branch of 
the logarithm function in Q is a choice z = log w, we Q, such that z is a 
holomorphic function of win Q. 

As a second example, consider the nth root, n a positive integer. If w) 4 0, 
choose a branch of log w holomorphic in a disc about wo. Then if we set 


wil” = exp (- log v), 
n 
this is a holomorphic function of w near wy and 


(wt!")" = exp (nt log ") = exp (log w) = w. 


We refer to w?/" as a branch of the nth root. 
There are exactly n branches of the mth root holomorphic near wo. In 
fact, suppose 
Zo” = Wo = Z,”". 


Then for any choice of log Zp and log Z,, 


exp (n log Z)) = exp (n log 2), 
SO 
n log Z = nlog z, + 2mzi, 


some integer m. This implies 
(4) Zo = exp (log 2.) =--- = Zz, exp (2mmin~*). 


Since 
exp (2mzin~1) = exp (2m'zin~1) 


if and only if (m — m’)n~? is an integer, we get all n distinct nth roots of wo 
by letting z, be a fixed root and taking m = 0,1,...,m — 1 in (4). 
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We can now describe the behavior of a nonconstant holomorphic function 
near a point where the derivative vanishes. 


Theorem 4.3. Suppose f is holomorphic in an open set Q, and suppose 
Zo € Q. Suppose f is not constant near Zo, and let m be the first integer >1 such 
that f(z) # 0. Let Wo = f(Zo). There are « > 0 and 6 > O such that if 


0< |w— w| <e 
there are exactly m distinct points z such that 
lz — Z| <6 and f(z)=w. 
Proof. By Lemma 4.1, 
F(Z) = Wo + (Z — 2o)"A(Z), 


where h is holomorphic in Q and h(z,) # 0. Choose a branch g of the mth 
root function which is holomorphic near A(z,)). Then near Zo, 


F(Z) = Wo + [2 — Zo g(A(Z))I" = Wo + k(z)", 


where k is holomorphic near Z). Then k(Z.) = 0, k’(Zo) = g’(A(Z)) # 0. For 
z near Zo, Z # Zo, We have 


f(z) = w_ if and only if k(z) = (w — w,)!™ 


for some determination of the mth root of w — wo. We can apply Theorem 
4.2 to k: there are «, 5 so that 
k(z) =t 


has a unique solution z in the disc |z — zo| < 6 for each ¢ in the disc 
|t| < e", But if 


0 < |w — Wo| <€é 
then w — W, has exactly m mth roots ¢, all with |t| < </". Q 


The following corollary is called the open mapping property of holo- 
morphic functions. 


Corollary 4.4. If Q is open and f: Q — C is holomorphic and not constant 
near any point, then f(Q) is open. 


Proof. Suppose wy € f(Q). Then wo = f(Zo), some Zp) € Q. We want to 
show that there is an e > 0 such that the disc |w — wo| < eis contained in 
J(Q). But this follows from Theorem 4.3. [J 


Exercises 


1. Use Corollary 4.4 to prove the Maximum Modulus Theorem: if f is 
holomorphic and not constant in a disc |z — z)| < R, then g(z) = | f(z)| 
does not have a local maximum at Zp. 
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2, With fas in Exercise 1, show that g(z) = | f(z)| does not have a local 
minimum at Zz, unless f(z)) = 0. 

3. Use Exercise 2 to give another proof of the Fundamental Theorem of 
Algebra. 

4. Suppose z = log w. Show that Re z = log |w]. 

5. Suppose fis holomorphic near zy and f(z.) 4 0. Show that log | f(z)| 
is harmonic near Zp. 

6. Use Exercise 5 and the maximum principle for harmonic functions to 
give another proof of the Maximum Modulus Theorem. 

7. Use the Cauchy integral formula (for a circle with center z,) to give 
still another proof of the Maximum Modulus Theorem. 

8. Use the Maximum Modulus Theorem to prove Corollary 4.4. (Hint: 
let Wo = f(Z) and let C be a small circle around zy such that f(z) # wo if 
zeC. Choose « > 0 so that | f(z) — wo] > 2e if zEC. If |w — wo| < «, can 
(f(z) — w)~+ be holomorphic inside C? 

9. A set Q ¢ C is connected if for any points zo, z,; € Q there is a (con- 
tinuous) curve y: [a, b] > Q with y(a) = Zo, y(b) = z,. Suppose Q is open 
and connected, and suppose fis holomorphic in Q. Show that if fis identically 
zero in any nonempty open subset Q, ¢ Q, then f = 0 in Q. 

10. Let Q be the union of two disjoint open discs. Show that Q is not 
connected. 


§5. Isolated singularities 


Suppose fis a function holomorphic in an open set ©. A point z, is said 
to be an isolated singularity of f if z) ¢ Q but if every point sufficiently close 
to Z) 1s in Q. Precisely, there is a 8 > O such that 


zEQ if0< |z-—2z] < 6. 


For example, 0 is an isolated singularity for f(z) = z~", n a positive integer, 
and for g(z) = exp (1/z). On the other hand, according to the definition, 0 
is also an isolated singularity for the function f which is defined by f(z) = 1, 
z # 0 and is not defined at 0. This example shows that a singularity may 
occur through oversight: not assigning values to enough points. An isolated 
singularity zy for fis said to be a removable singularity if f can be defined at z, 
in such a way as to remain holomorphic. 


Theorem 5.1. Suppose Z, is an isolated singularity for the holomorphic 
function f. It is a removable singularity if and only if f is bounded near Zo, 
i.e., there are constants M, 6 > 0 such that 


(1) @Ql<M  if0<|z—2] <8, 


Proof. Suppose Zp, is a removable singularity. Then f has a limit at Zo, 
and it follows easily that (1) is true. 
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Conversely, suppose (1) is true. Choose r with 0 < r < 6 and let e > 0 
be such that 0 < « < r. Let C be the circle with center z,) and radius r. Given 
w with 0 < |w — z,| < r, choose « so small that 0 < e < |w — Zol, and let 
C, be the circle with center z) and radius «. By Exercise 5 of §3, 


f00) = 55] SOME ~ wrt de — | See — w)-* de 


Since fis bounded on C independent of «, the second integral goes to zero as 
e—> 0. Thus 


(2) fw) = im | SOC _wytd, 0<|w—2| <r. 


We may define f(z)) by (2) with w = Zp), and then (2) will hold for all w, 
|w — Zo| < r. The resulting function is then holomorphic. { 


An isolated singularity z) for a function f is said to be a pole of order n 
for f; where n is an integer >1, if fis of the form 


(3) F(Z) = (2 — 2)" "g(Z) 


where g is defined at Zz) and holomorphic near Zp, while g(Z)) 4 0. A pole of 
order 1 is often called a simple pole. 


Theorem 5.2. Suppose Zo is an isolated singularity for the holomorphic 
function f. It is a pole of order n if and only if the function 


(Z — 20)"f(z) 
is bounded near Zo, while the function 
(z — 2)""*f(Z) 
is not. 

Proof. It follows easily from the definition that if z) is a pole of order n 
the asserted consequences are true. 

Conversely, suppose (z — Z,)"f(z) = g(z) is bounded near Z). Then Zp is 
an isolated singularity, so we may extend g to be defined at z, and holo- 
morphic. We want to show that g(z,) 4 0 if (z — Z)"~+/(z) is not bounded 
near Zp. But if g(z)) = 0 then by Lemma 4.1, 

g(z) = ( — 2)h(z) 
for some m > 1 and some h holomorphic near Zp. But then (z — 2Z))"~4f(z) = 
(z — Z))"~*h(z) is bounded near Zz). [ 


An isolated singularity which is neither removable nor a pole (of any 
order) is called an essential singularity. Note that if zp is a pole or a removable 
singularity, then for some ae C ora = © 


f(z) >a as zZ->2Z 


This is most emphatically not true near an esseutial singularity. 
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Theorem 5.3 (Casorati-Weierstrass). Suppose Zo is an isolated singularity 
for the holomorphic function f. If Zo is an essential singularity for f, then for 
any e > Oand any aéC there is az such that 

|Z — Z| <e, | f(z) — al < «. 

Proof. Suppose the conclusion is not true. Then for some « > O and 

some a eC we have 
|f(z) — a| > e where 0 < |z — Z| <e. 


Therefore h(z) = (f(z) — a)~? is bounded near Zp. It follows that h can be 
extended so as to be defined at Zz) and holomorphic near Zp. Then for some 
m > 0, 


A(z) = (Z — 2o)"k(z) 

where k is holomorphic near Z) and k(Z,) 4 0. We have 

f(z) =a+h(z)7t' =at (z - 29) "k(z)7, 0 < |z — Z| <e. 
Therefore Zp) is either a removable singularity or a pole forf. QJ 

Actually, much more is true. Picard proved that if z) is an isolated essen- 
tial singularity for f, then for any « > 0 and any aeC, with at most one 
exception, there is a z such that 0 < |z — zo| < e and f(z) = a. An example 
is f(z) = exp (1/z), z # 0, which takes any value except zero in any disc 
around zero. 

Isolated singularities occur naturally in operations with holomorphic 
functions. Suppose, for example, that f is holomorphic in Q and z)é€Q. 


If f(z.) # 0, then we know that f(z)~* is holomorphic near z). The function 
fis said to have a zero of order n (or multiplicity n) at z), n an integer > 0, if 


f(z) = 0, O<k<n. 
I™(Zo) # 0. 


(In particular, f has a zero of order zero at Zp if f(z.) # 0.) A zero of order 
one is called a simple zero. 


Lemma 5.4. Jf fis holomorphic near zy and has a zero of order n at Zo, 
then f(z)~* has a pole of order n at Zp. 


Proof. By Lemma 4.1, 
(4) f(z) = (2 — 2)"h(Z), 


where / is holomorphic near z) and h(z.) # 0. The desired conclusion 
follows. [J 


For example, the function 
sec z = (cos z)~? 


is holomorphic except at the zeros of cos z, where it has poles (of order 1). 
The same is then true of 


tan z = sin z(cos z)~? = sin Z sec Z. 
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It is convenient to assign the “‘value”’ oo at Zz) to a holomorphic function 
with a pole at Zz. Similarly, if z) is a removable singularity for f we shall 
consider f as being extended to take the appropriate value at z). With these 
conventions we may work with the following extension of the notion of 
holomorphic function. 

Suppose Q < C is an open set. A function f: Q—>C U {oo} is said to 
be meromorphic in Q if for each point Zp € Q, either Z) is a pole of f, or fis 
holomorphic in the disc |z — z)| < 6 for some 6 > 0. 


Theorem 5.5. Suppose Q is open and f, g are meromorphic in Q. Suppose 
aéC. Then the functions 


af, fte, fg 


are meromorphic in Q. If Q is connected and g € 0 in Q, then f/g is mero- 
morphic in Q. 


Proof. We leave all but the last statement as an exercise. Suppose is 
connected, i.e. for each Zo, Zz, € 2 there is a continuous curve y: [0, 1] ~ Q 
with y(0) = 2, y(1) = 2;. Given any Z) € Q, either g(z)~* is meromorphic 
near Z) or g vanishes in a disc around Zo. We want to show that the second 
alternative implies g = 0 in all of Q. 

Suppose g vanishes identically near z) € 2 and suppose z, is any other 
point of Q. Let y be a curve joining Z) to z,: y(O) = Zo, y(1) = Z;. Let A be 
the subset of the interval [0, 1] consisting of all those t such that g vanishes 
identically near y(t). Let c = lub A. There is a sequence (¢,)? © A such that 
t, — c. Lf g did not vanish identically near y(c) then either g(y(c)) # 0, or 
y(c) is a zero of order n for some n, or y(c) is a pole of order n for some x. 
But then (see (3) and (4)) we could not have g identically zero near y(t,) for 
those 7 so large that y(t,) is very close to y(c). Thus g vanishes identically 
near c. This means that c = 1, since otherwise g vanishes identically near 
y(c + e) for small « > 0. Therefore g(z,) = g(y(1)) = g(y(c)) = 0. 

We know now that given Zy € Q, either g(z)) # 0 or Zp is either a zero or 
pole of order n for g. It follows that either g(z)~* is holomorphic near Zp, 
or, by (4), that zp is a pole of order n, or, by (3), that Z is a zero of order n. 
Thus g(z)~? is meromorphic in Q, and so f/g is also. QJ 


Exercises 


1. Prove the rest of the assertions of Theorem 5.5. 

2. Show that tan z is meromorphic on all of C, with only simple poles. 
What about (tan z)?? 

3. Determine all the functions f, meromorphic in all of C, such that 


| f(z) — tan z| < 2 


at each point z which is not a pole either of f or of tan z. 
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4. Suppose f has a pole of order n at z). Show that there are an R > 0 
and a 6 > O such that if |w| > R then there are exactly n points z such that 


f(z) = w, |Z — Zo| < 6. 


(Hint: recall Theorem 4.3.) 
5. A function fis said to be defined near 00 if there is an R > 0 such that 


{Z| |z| > R} 


is in the domain of definition of f. The function fis said to be holomorphic 
at oo if 0 is a removable singularity for the function g defined by 


g(z) = f(1/2z), 


1/z in the domain of f. Similarly, 00 is said to be a zero or pole of order n 
for fif 0 is a zero or pole of order n for g. Discuss the status of oo for the 
following functions: 


(a) f(z) = z", n an integer. 
(b) f(z) = e. 

(c) f(z) = sin z. 

(d) f(z) = tan z. 


6. Suppose Zp, is an essential singularity for f, while g is meromorphic near 
Z) and not identically zero near Zp. Is z) an essential singularity for fg? What 
if, in addition, Zz, is not an essential singularity of g? 


§6. Rational functions; Laurent expansions; residues 


A rational function is the quotient of two polynomials: 


F(z) = p(@)/q) 


where p and g are polynomials, g # 0. By Theorem 5.4, a rational function is 
meromorphic in the whole plane C. (In fact it is also meromorphic at «; 
see Exercise 5 of §5.) 

It is easy to see that sums, scalar multiples, and products of rational 
functions are rational functions. If fand g are rational functions and g # 0, 
then f/g is a rational function. In particular, any function of the form 


(1) F(Z) = ay(z — 20)7* + az — 29)7? ++++ + a(Z — Z)™" 
is a rational function with a pole at z). We can write 
(2) f(z) = p((z — 20)~*) 


where p is a polynomial with p(0) = 0. It turns out that any rational function 
is the sum of a polynomial and rational functions of the form (1). 


Theorem 6.1. Suppose f is a rational function with poles at the distinct 
points 21, Z9,...,Zm and no other poles. There are unique polynomials po, 
P15 ++ +5 Dm Such that p{O) = 0 ifj 4 0 and 
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(3) f(z) — Po(Z) + pPi((z _ Z1)~*) beet Prl(z ~ Zm)~ +). 


Proof. We induce on m. If m = 0, then fhas no poles. Thus fis an entire 
function. We have f = p/q where p and qg are polynomials and g # 0. Suppose 
p is of degree r and gq is of degree s. It is not hard to show that there is a 
constant a such that 


2s-"f(z) >a aS zZ—->0OO. 


This and Exercise 4 of §3 imply that fis a polynomial. 

Now suppose the assertion of the theorem is true for rational functions 
with m — 1 distinct poles, and suppose f has m poles. Let Zp be a pole of f, 
of order r. Then 


f(z) = (2 — 20)" *h(2); 


where /: is holomorphic near Zp. Near Zo, 


h(z) = >, an(z — 20)”. 
n=1 
Therefore 
f(z) = ao(z — 20)~" + ay(z — Zo)*7" +-°- + G,-1(Z — Zo)~* + KZ), 


where k is holomorphic near zy. Now k is the difference of two rational 
functions, hence is rational. The function g = f — k has no poles except at 
Zo, so the poles of k are the poles of f which differ from Zz. By the induction 
assumption, k has a unique expression of the desired form. Therefore f has 
an expression of the desired form. 

Finally, we want to show that the expression (3) is unique. Suppose pp 
is of degree k. The coefficient b, of z* in po can be computed by taking limits 
on both sides in (3): 


lim z~*f(z) = lim z~*po(z) = by. 
Therefore the coefficient of z*-+ can be computed: 
lim z+~*[ f(z) — b,z*] = b,-4, ete. 

Continuing in this way, we determine all coefficients of po. Similarly, if p; 
is of order r then the coefficient of the highest power of (z — z,)~* in (3) is 
lim (z — 2:)f(Z) = ¢, 

272, 
the coefficient of the next power is 


lim (z — 21)" *[f(z) — ¢(z — 21)". 


All coefficients may be computed successively in this way. ( 


The expression (3) is called the partial fractions decomposition of the 
rational function f. 
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Let us note explicitly a point implicit in the preceding proof. If f has a 
pole of order r at Zo, then 


f= > b(z-2)7", O< |z— 2 < 8, 


=-7 


some 5 > 0. This generalization of the power series expansion of a holo- 
morphic function is called a Laurent expansion. It is one case of a general 
result valid, in particular, near any isolated singularity. 


Theorem 6.2. Suppose f is holomorphic in the annulus 
A = {z|r< |z — Z| < R}. 


Then there is a unique two sided sequence (a,)~ . < C such that 
(4) f(z) = > A(Z— 2)", r< |zZ—2|< RK. 


Proof. Suppose r < |z — Zo| < R. Choose r,, R, such that 
r<7ry< |z—2| < R, < R. 


Let C, be the circle of radius r, and C, the circle of radius R, centered at Zp. 
By Exercise 5 of §3, 


A) = Fj | Fw — 20) dw — 5 | flw)Gw — 20)* 
= f2) + fi) 


Here f. and f, are defined by the respective integrals. We consider /f, as 
being defined for |z — z.| < R, and f, as being defined for |z — zo| > ry. 
Then f, is holomorphic and has the power series expansion 


(5) falz) = > a(z— 2)",  |z— zo < Ri. 


Moreover, by the Cauchy integral theorem we may increase R, without 
changing the values of f2 on |z — z,| < R,; thus the series (5) converges 
for |z — zo| < R. 

The function f, is holomorphic for |z — zo| > r,. Again, by moving the 
circle C,, we may extend f, to be holomorphic for |z — z,| > r. To get an 
appropriate series expansion we proceed as in the proof of Corollary 3.3. 
When |z — Z| > r, and |w — zo| = ry, 


(w — z)-* = ((w — 2) — (2 — 29))"* = —(@ — Zo) *[1 — (w — Zo)(Z = 20) "T° 


> — Zo)"~*(z — 2)~”. 


The series converges uniformly for w € C,, so 


(6) fz) = > a_(z — 2)-" 
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where 


= = \. S(w)(w — 20)" dw. 


Equations (5) and (6) together give (4). 
Finally, we want to prove uniqueness. Suppose (4) is valid. Then the 
power series 


an 


S a,(Z ~ Zo)" 


n=—@ 


converges for r < |z — Z| < R, so it converges uniformly on any smaller 
annulus. It follows that if C is any circle with center Zz), contained in A, then 
(3) may be integrated term by term over C. Since 


| (2 — 2)" dz 
C 
is zero for n # 1 and 2zi for n = —1, this gives 


(7) a1= 55 | fo dz. 


More generally we may multiply f by (z — z.)~"~+ and integrate to get 


(8) am = 5 | See ~ 20)? de 


all m. Thus the coefficients are uniquely determined. [ 


In particular, Theorem 6.2 applies when f has an isolated singularity at 
Zo. In this case the Laurent expansion (3) is valid for 


0 < |z— Z| < R, 


some R > 0. The coefficient a_, is called the residue of f at z). Equation (7) 
determines the residue by evaluating an integral; reversing the viewpoint 
we may evaluate the integral if we can determine the residue. These observa- 
tions are the basis for the “calculus of residues.” The following theorem is 
sufficient for many applications. 


Theorem 6.3. Suppose C is a circle or rectangle. Suppose f is holomorphic 
in an open set Q containing C and all points enclosed by C, except for isolated 
singularities at the points Z,, Z2,..., Zm enclosed by C. Suppose the residue of 
f at z; is b;. Then 


(9) f(z) dz = 2mi(b, + by +--+ + by). 


Proof. Let C;,...,C, be nonoverlapping circles centered at Z,,..., Zm 
and enclosed by C. Then 


2nib, = | f(z) dz. 
Cs 
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Applying Exercise 6 of §3, we get (9). [ 


If f has a pole at Z), the residue may be computed as follows. If n is the 
order of the pole, 


flz) = (@ — 20)""h(2) = (@ — %)-" > by(z — Zo)", 


so the residue at Zp 1s 
Bm-1 = [(m — Ith (Zz). 
In particular, at a simple pole m = 1 and the residue is A(Zp). 


Let us illustrate the use of the calculus of residues by an example. Suppose 
we want to compute 


I =| (1 + t*)"1 dt 
0 
and have forgotten that (1 + t7)~1 is the derivative of tan~+ ¢. Now 
1” _ 
1=35] (1 + ¢*)~? dt. 


Let Cp, R > 0, be the square with vertices +R and +R+ Ri. Let f(z) = 
(1 + z?)~?. The integral of f over the three sides of Cz which do not lie on 
the real axis is easily seen to approach zero as R —> oo. Therefore 


co R 
| (1 + 12)-1 dt = lim | (1 + 12)" dt 


= lim F(z) dz. 
R~o Cr 

For R > 1, fis holomorphic inside Cp except at z = i, where it has a simple 

pole. Since 


ff) = (@ - NE + i), 


the residue at 7 is (2i)~?. Therefore when R > 1, 


f(z) dz = 2i(2i)~1 = z. 
Cr 
We get I = 47 (which is tan~! (+00) — tan~10, as it should be). 

We conclude with some further remarks on evaluating integrals by this 
method. Theorem 6.3 is easily shown to be valid for other curves C, such as 
a semicircular arc together with the line segment joining its endpoints, or a 
rectangle with a portion or portions replaced by semicircular arcs. The 
method is of great utility, depending on the experience and ingenuity of the 
user. 
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Exercises 


1. Compute the partial fractions decomposition of 
z3(z2 — 3z + 2)7}, (z? + 227 + z)71. 


2. Find the Laurent expansion of exp (1/z) and sin (1/z) at 0. 
3. Compute the definite integrals 


i.8) 


| (t + 1)(t® — 2it? + t — 2i)-+ dt 
| r7(t* + 1)7* dt. 
0 


4. Show that J . t-+sin t dt = 47. (Hint: this is an even function, and 
it is the imaginary part of f(z) = z~ +e’. Integrate f over rectangles lying in 
the half-plane Im z > 0, but with the segment —e < ¢t < « replaced by a 
semicircle of radius e in the same half-plane, and let the rectangles grow long 
in proportion to their height.) 

5. Show that a rational function is holomorphic at oo or has a pole at oo. 

6. Show that any function which is meromorphic in the whole plane and 
is holomorphic at 00, or has a pole there, is a rational function. 

7. Show that if Re z > 0, the integral 


rz) =| $2-19-¢ df 
0 
exists and is a holomorphic function of z for Re z > 0. This is called the 
Gamma function. 
8. Integrate by parts to show that 


T(z + 1) = zl (2), Rez > 0. 
9. Define ['(z) for —1 < Rez < 0, z 4 0, by 
T(z) = z~+I(z + 1). 


Show that [is meromorphic for Re z > —1, with a simple pole at zero. 

10. Use the procedure of Exercise 9 to extend I’ so as to be meromorphic 
in the whole plane, with simple poles at 0, —1, —2,.... 

11. Show inductively that the residue of I at —n is (—1)"(v!)7?. 

12. Is I’ a rational function? 


§7. Holomorphic functions in the unit disc 


In this section we discuss functions holomorphic in the unit disc D = 
{z | |z| < 1} from the point of view of periodic functions and distributions. 
This point of view gives another way of deriving the basic facts about the 
local theory of holomorphic functions. It also serves to introduce certain 
spaces of holomorphic functions and of periodic distributions. 
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Note that if fis defined and holomorphic for |z — z)| < R, then setting 
fi(w) = fo + Rw) 


we get a function f; holomorphic for |w| < 1. Similarly, if f has an isolated 
singularity at 29, we may transform it to a function with an isolated singu- 
larity at zero and holomorphic elsewhere in the unit disc. Since f can be 
recovered from f, by 


f(z) = fi(R-*(Z — 20), 


all the information about local behavior can be deduced from study of fj 
instead. 
Suppose fis holomorphic in D. Then the function 


g(r, 8) = f(re™) 


is periodic as a function of 6 for 0 < r < 1 and constant for r = 0. It is also 
differentiable, and the assumption that fis holomorphic imposes a condition 
on the derivatives of g. In fact 


h-*[g(r + h, 8) — g(r, | = h-[flre® + he) — fire”) 
= e'*(he®)—1[ f(re® + he’) — f(re')]. 


Letting h —> 0 we get 
iM g(re’®) = ef'(re"®), 


Similarly, 


h-*[g(r.8 + h) — g(r, 8] = Am *[f(re*) — fire'*)] 


~ f'(re®)-h-Y[re!@+ — pet] 

SO 

Y 16 ry pO! 16 

56 elre ) = iref"(re’®). 
Combining these equations we get 

Oo y 
(1) ire = 38. 
Now let g,(@) = g(r, 9), 0 < r < 1. Since g, is continuous, periodic, and 


continuously differentiable as a function of 0, it is the sum of its Fourier 
series: 


(2) g,(0) = > a,(r)e. 
The coefficients a,(r) are given by 


2n 
a,(r) = = | g(r, Ae? dé. 
0 
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It follows that a,(r) is continuous for 0 <r <1 and differentiable for 
0 <r < 1, with derivative 


’ _ | an og ~ing 
a,(r) = an | ap (r, Ae? dd. 


Using (1) and integrating by parts we get 
(3) ar) =r-*nafr), n#0, 


and aj(r) = 0. Thus a> is constant. The equation (3) may be solved for a, 
as follows, n ~ 0. The real and imaginary parts of a, are each real solutions of 


u(r) = r-*nu(r). 


On any interval where u(r) # 0 this is equivalent to 
© log |u(r)| = d 77 oer” 


so on such an interval u(r) = cr”, c constant. Since a, is continuous on [0, 1) 
and vanishes at 0 ifn 4 0 (because gy is constant), we must have a,(r) = a,r”, 
with a, constant and a, = Oifn < 0. 
We have proved the following: if fis holomorphic in D, then 
(4) f(re®) = > arte, Osrd<l. 
n=0 
Thus 


f(z) = > Anz”, |z| < 1. 


Suppose f is holomorphic in D and defined and continuous on the 
closure: {z | |z| < 1}. Then the functions a,(r) = a,r" are also continuous 
atr = 1, anda, = a,(1) is the nth Fourier coefficient of g,. It follows that g, 
is a convolution: 


(5) 8r = Q, * S15 


where Q, is the periodic distribution with Fourier coefficients b, = r",n > 0 
and b, = 0,n < 0. Then 


O,(0) = 2, preind _ > (re'®)" 
OT 
(6) O,(8) = (1 — re’*)~*. 


Equation (5) can be written 


20 
S(re**) — ~ (1 _ re(9—b)— lf (eit) at 


= 55 Le (et — re®)-1f(e)iett dt. 
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Setting w = re’ and z = e*, we recover the Cauchy integral formula 


(7) Po) = 5 | fee — wy * de, 


Ti 
where C is the unit circle. Thus (5) may be regarded as a version of the Cauchy 
integral formula. 

Note that Q, is a smooth periodic function when 0 < r < 1. Therefore 
(5) defines a function in the disc if g, is only assumed to be a periodic distri- 
bution. In terms of the Fourier coefficients, if g, has Fourier coefficients 
(a,)~ then those of g, are (a,(r))2.. where a,(r) = 0,” < 0, a,(r) = a,r", 
n > 0. These observations and the results of §§1 and 2 of Chapter 5 leads 
to the following theorem. 


Theorem 7.1. Suppose F is a periodic distribution with Fourier coefficients 
(An) 2 wo, Where a, = O forn < 0. Let f be the function defined in the unit disc by 


(8) f(re®) = (F* Q,)(9), 
with Q, given by (6). Then f is holomorphic in the unit disc and 


(9) f(z) = > Anz", |z| < 1. 


Moreover, F is the boundary value of f, in the sense that the distributions 
F,, defined by the functions f,(@) = f(re'®) converge to F in the sense of F' as 
r—>l. 

Conversely, suppose f is holomorphic in the unit disc. Then f is given by a 
convergent power series (9). If the sequence (a,)¢ is of slow growth, i.e., if 
there are constants c,r such that 


(10) la,| < cn", n>O, 


then there is a distribution F such that (8) holds. If we require that the Fourier 
coefficients of F with negative indices vanish, then F is unique and is the boundary 
value of f in the sense above. 


Condition (10) is not necessarily satisfied by the coefficients of a power 

series (9) converging in the disc. An example is 
a, = n°, n> 0. 

Thus the condition (10) specifies a subset of the set of all holomorphic 
functions in the disc. This set of holomorphic functions is a vector space. 
Theorem 7.1 shows that this space corresponds naturally to the subspace of 
?’ consisting of distributions whose negative Fourier coefficients all vanish. 

Recall that Fe #’ is in the Hilbert space #? if and only if its Fourier 
coefficients (a,)~. satisfy 


(11) > |an|? < 0. 


For such distributions there is a result exactly like the preceding theorem. 
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Theorem 7.2. Suppose F is a periodic distribution with Fourier coefficients 
a, =O forn < 0. If FE L?, then F is the boundary value of a function f, 


(12) f(z) = > az", [2] <1 
n=0 

with 

(13) > |an|? < ©. 
n=0 


The functions f,(0) = f(re'®) converge to F in the sense of L? as r—>1, and 


VFI = sup, se | Ure) a, 


Conversely, suppose f is defined in the unit disc by (12). Suppose either that 

(13) is true or that 
20 
(14) sup | | f(re®)|2 dd < w. 
Osrsl Jo 

Then both (13) and (14) hold, and the boundary value of f is a distribution 
FeL?, 

Proof. The first part of the theorem follows from Theorem 7.1 and the 


fact that (11) is a necessary condition for F to be in L?. The second part of 
the theorem is based on the identity 


22 ve) 
(15) =| | f(re*®)|? dé = > |a,|?r2”, O0<r<il, 
T Jo n=0 


which is true because the Fourier coefficients of f, are a,r” for n => 0 and 
zero for n < O. If (13) is true then (14) follows. Conversely, if (13) is false, 
then (15) shows that the integrals in (14) will increase to 00 as r-> 1. Thus 
(13) and (14) are equivalent. By Theorem 7.1, if (13) holds then f has a 
distribution F as boundary value. The a, are the Fourier coefficients of F, 
so (13) implies Fe L?. QJ 

The set of holomorphic functions in the disc which satisfy (14) is a vector 
space which can be identified with the closed subspace of L? consisting of 
distributions whose negative Fourier coefficients are all zero. Looked at 
either way this is a Hilbert space, usually denoted by 


H? or H*(D). 


Exercises 
1. Verify that 
f(z) = > nvrzn 
n=1 


converges for |z| < 1 but that the coefficients do not satisfy (10). 
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2. Suppose f is holomorphic in the punctured disc 0 < |z| < 1. Carry 
out the analysis of this section for 


g(r, 0) = f(re") 
to deduce: 
(a) I(2) = Dn=- 0 a,z", 0< [z| < l, 
(b) If | f(z)| < M|z|~™ for some M, m, then 


foe] 


f@= > anz". 


=<—m 


Chapter 7 


The Laplace Transform 


§1. Introduction 


It is useful to be able to express a given function as a sum of functions 
of some specified type, for example as a sum of exponential functions. We 
have done this for smooth periodic functions: if ue FZ then 


(1) u(x) = > a,e™, 
where 
l 21 
— __ —inx 
(2) Qn = 5 | u(x)e dx. 


Of course the particular exponential functions which occur here are precisely 
those which are periodic (period 27). If u: R + C is a function which is not 
periodic, then there is no such natural way to single out a sequence of 
exponential functions for a representation like (1). One might suspect that 
(1) would be replaced by a continuous sum, i.e., an integral. This suspicion 
is correct. To derive an appropriate formula we start with the analogue of 
(2). Let 


(3) se) =| uten* de 


when the integral exists. (Of course it may not exist for any zeC unless 
restrictions are placed on u.) 

If we are interested in functions u defined only on the half-line [0, 0), 
we may extend such a function to be zero on (—o, 0]. Then (3) for the 
extended function is equivalent to 


(4) g(z) = } u(t)e-* dt. 
0 
If wis bounded and continuous, then the integral (4) will exist for each 
zéC which has positive real part. More generally, if ae R and e~“u(t) is 
continuous and bounded for ¢ > 0, then the integral (4) exists for Re z > a. 
Moreover, the function 


(5) g = Lu 
is holomorphic in this half plane: 


gz) = -| tu(t)e~** dt, Rez> a. 


fy 
190 
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In particular, let 


u,(t) = e”, t > 0; 
= 0, t < 0, weC. 


Then for Rez > Rew, 


Lu,(z) -| eM 2t dt = (z — w)7}, 
0 

The operator L defined by (3) or (4) and (5) assigns, to certain functions 
on R, functions holomorphic in half-planes in C. This operator is clearly 
linear. We would like to invert it: given g = Lu, find u. Let us proceed 
formally, with no attention to convergence. Since Lu is holomorphic in some 
half-plane Re z > a, it is natural to invoke the Cauchy integral formula. 
Given z with Re z > a, choose b such that 


a<b< Rez. 


Let C be the vertical line Re w = b, traced in the upward direction. We 
consider C as “enclosing” the half-plane Re w > b, though traced in the 
wrong direction. A purely formal application of the Cauchy integral formula 
then gives 


g(z) = Lu(e) = 5 oc ~ 2-1 dw 


- = [ flw)Lu,(z) dw. 


If L has an inverse, then L~? is also linear. Then we might expect to be able 
to interchange L~! and integration in the preceding expression, to get 


— 29 wt 
(6) u(t) ~ ori |. g(w)e dw, 
or 
u(t) = J | g(a + is)et* ds, 
2 J_ os 


Thus (3) or (4) and (6) are our analogues of (2) and (1) for periodic functions. 

It is convenient for applications to interpret (3) and (6) for an appropriate 
class of distributions F. Thus if Fis a continuous linear functional on a suitable 
space £ of functions, we interpret (3) as 


LF(z) = F(e.), e(t) = e7*, 


The space ¥ will be chosen in such a way that each such continuous linear 
functional F can be extended to act on all the functions e, for z in some 
half-plane Re z > a. Then the function LF will be holomorphic in this half- 
plane. We shall characterize those functions g such that g = LF for some F, 
and give an appropriate version of the inversion formula (6). 
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The operator L is called the Laplace transform. It is particularly useful 
in connection with ordinary differential equations. To see why this might be 
so, let u’ = Du be the derivative of a function uw. Substitution of u’ for u in 
(3) and integration by parts (formally) yield 


(7) [Lu’|(z) = zLu(z). 
More generally, suppose p is a polynomial 
P(Z) = AnZ™ + An iyZ™~ 1 +++ + az + Ap. 
Let p(D) denote the corresponding operator 
P(D) = a,D™" + Am—-1D™~1 +-+-+a,D + a, 


1.€., 
P(D)u(z) = agu™(x) +--+ + ayu’(x) + agu(x). 
Then formally 
(8) [Lp(D)u\(z) = p(z)Lu(z). 
Thus to solve the differential equation 
p(D)u = v 
we want 


P(z)Lu(z) = Lw(2). 
From (6), this becomes 
(9) u(t) = = I e*'p(z)~1Lv(z) dz. 


As we shall see, all these purely formal manipulations can be justified. 


Exercises 


1. Show that the inversion formula (6) is valid for the functions w,,, i.e., 
ifa > Rew and C = {z| Rez = qa} then 


] 


— _ —1pet — puwt . 
mai |. w)7 *e#* dz = e”, t>0; 


= 0, t< 0. 


2. Suppose u: [0, 00) > C is bounded and continuous and suppose the 
derivative u’ exists and is bounded and continuous on (0, 00). Show that 


[ e~*y'(t) dt = an e~ y(t) dt — u(0), 


Re z > 0. Does this conflict with (7)? 
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§2. The space # 


Recall that a function u: RC is said to be smooth if each derivative 
D*u exists and is continuous at each point of R, k = 0,1,2,.... In this 
section we shall be concerned with smooth functions u which have the 
property that each derivative of uw approaches 0 very rapidly to the right. 
To be precise, let & be the set of all smooth functions u: RC such that 
for every integer kK => 0 and every aE R, 


(1) lim e“D*u(t) = 0. 

to + 0 
This is equivalent to the requirement that for each integer k > 0, eachaeR, 
and each Me R the function 
(2) e D¥u(t) 
is bounded on the interval [M, oo). In fact (1) implies that (2) is bounded on 
every such interval. Conversely if (2) is bounded on [M, oo) then (1) holds 


when a is replaced by any smaller number a’ < a. 
It follows that if ue Y then for each k, a, M we have that 


(3) Juleaae = Sup {e*| D*u(t)| | t € [M, o0)} 


is finite. Conversely, if (3) is finite for every integer k > 0 and every aE R, 
MeR,thenucY 

The set of functions / is a vector space: it is easily checked that if u, ve F 
and be C then bu and u + v are in & Moreover, 


(4) [bu|s,a,0 = [5] |ulea,ms 
(5) [ju + vleam S |Ulecam + |Ule.a.m 


A sequence of functions (u,)? © £ is said to converge to ue £ in the 
sense of £ if for each k, a, M, 


[un — Ulkam—>O as n>, 
If so, we write 
Un —>Uu (L). 


The sequence (u,)° © # is said to be a Cauchy sequence in the sense of £ 
if for each k, a, M, 


[Un _ Un\k.a,m —> 0 as m,n-—->oO., 


As usual, a convergent sequence in this sense is a Cauchy sequence in this 
sense. The converse is also true. 


Theorem 2.1. £ is a vector space. It is complete with respect to conver- 
gence as defined by the expressions (3); i.e., if (u,)? < L is a Cauchy sequence 
in the sense of £, then there is a unique u€ £ such that u, > u (PL). 
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Proof. Let (u,)? be a Cauchy sequence in the sense of “% Taking (3) 
with a = 0, we see that each sequence of derivatives (D*u,)? is a uniform 
Cauchy sequence on each interval [M, oo). It follows, by Theorem 4.1 of 
Chapter 2, that there is a unique smooth function u such that D*u, — D*u 
uniformly on each [M, 00). Now let a be arbitrary. Since (e“ D*u,)? is also 
a uniform Cauchy sequence on [M, oo) it follows that this sequence converges 
uniformly to e“D*u. Thusue # andu, > u(f). J 


It follows immediately from the definition of & that certain operations 
on functions in # give functions in % In particular, this is true of differen- 


tiation: 
Due L ifue Ff; 


translation: 
Tue L& fue L 


where 
(Tyu)(t) = u(t — s); 
and complex conjugation: 


ure FL ifuce Ff 


where 
u*(t) = u(t)*. 


It follows that if ue &% so are the real and imaginary parts: 


Reu = (tu + u*), 
Im u = 5 (u* — Uu). 
Moreover, if ue Y so is the integral of u taken from the right: 


S,u(t) = — | ” us) ds. 


t 
In fact, DS,u = uso 


(6) |[Sauleam = [Ule-1,0,m> k>1. 


For k = 0, note that fort > M,a> 0, 


|S,.u(t)| < { |u(s)| ds < |uloam [ e~ % ds 
t t 
= a |ulo.a.me™. 
Thus 
(7) [Suloam < @*|ulog me ™, a> 0. 
The finiteness of 
|Sulo.a,m 


for a < 0 follows from finiteness for any a > 0. Thus Syue &% 
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Lemma 2.2. The operations of differentiation, translation, complex con- 
jugation, and integration are continuous from £& to £& with respect to conver- 
gence in the sense of £. Moreover, ifue £& then the difference quotient 


s-(T_wu — u)—> Du(L) 
as s—>0. 


Proof. These statements chiefly involve routine verifications. We shall 
prove the final statement. Given an integer k = 0, let v = D*u. Suppose 
t > Mand 0 < |s| < 1. The Mean Value Theorem implies 


s~*[T_ v(t) — v(t)] = Dvr) 
where |t — r| < |s|. Then 


D¥{s-+[T_ u(t) — u(t)]}} — D* Du(t) 


Do(r) — Dv(t) 
D*ty(r) — D**+u(t). 


But 
(8) |D**tu(r) — D*ttu(t)| < cla, M)e~™, t= —M. 


The left side of (8) converges to zero as s —> 0, uniformly on bounded inter- 
vals. It follows that 


ls-"[T_u — u] — Dulya.m > 0 
as s—>0,foranya’<a. {QJ 
The functions e,, 
(9) e(t) = e~*, teR, zeEC 


are not in Y for any z € C. However, they may be approximated by functions 
from £ in a suitable sense. 


Lemma 2.3. Suppose Re z > a. There is a sequence (u,)~ < L£ such that 


|un — Czle.a.m—>90 as n—->oO 
for each integer k = O and each MER. 


Proof. Choose a smooth function » such that g(t) = 1 if ¢ < 1 and 
g(t) = Oif t > 2. (The existence of such a function is proved in §8 of Chapter 
2.) Let 


pr(t) = p(t/n), Un = Pn€z- 


Then u, is smooth and vanishes for t > 2, so u, ¢ YL. We shall consider in 
detail only the (typical) case kK = 1. 


e“(Du,(t) — De,(t)) = e“(p,(t) Det) + e(t)De,(t) — De,{t)) 
= (1 — y,(t)zee"™ + Do,(tee~™. 
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Now both 1 — 9,(t) and Dg,(t) are bounded independent of ¢ and n, and 
vanish except on the interval [n, 2]. Therefore 

|e“(Du,(t) — De{t))| < c exp (na — n Rez), 
c independent of n and ¢. Thus 


|un — Czl1,a,.¢—>90 as n>, 


all M. The argument for other values of k is similar. {] 


The following lemma relates the |u|,.,.2,4 for different values of the indices 
k, a, M. 


Lemma 2.4. Suppose k, k' are integers, and 
0O<k <k’, a<a, M>M’. 


Suppose also either that k = k' or that a’ > 0. Then there is a constant c such 
that 


(10) luleam < ClUleamw,  alucL 


Proof. It is sufficient to prove (10) in all cases when two of the three 
indices are the same. The case k = k’, a = a’ is trivial. The case k = k’, 
M = M' is straightforward. Thus, suppose a = a’ > 0 and M = M’. Let 
k’ =k +jand set v = D*’u. We may obtain D*u from v by repeated inte- 


grations: 
D*u = (S,)%v. 


We use (7) repeatedly to get 


lulea.a = |D*uloam < a~‘|v|o.a,m 
— a~4| D¥'ulo.a.m 
a~*|u|x,,m- O 


If ue FY we set 
Julic = [Ulee,-e = sup {le D*u(t)| | t = —K}. 
Then the following is an easy consequence of Lemma 2.4. 
Corollary 2.5. Suppose (u,)¢ < & Then 
U, > u (FL) 
if and only if for each integer k > 0, 


|un — uly» —>O as n—->o. 


Exercises 


1. Show that u(t) = exp (—£?) is in 
2. Show that if u, ve Y then the product wv is in Z% 
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3. Show that ue YzeC implies euce LF 
4. Complete the proof of Lemma 2.2. 


§3. The space 2’ 


A linear functional on the vector space # is a function F: # ->C such 
that | 


F(au)=aF(u), Fut v) = Flu) + Flv). 
A linear functional F on & is said to be continuous if 
u,—>u (L) 
implies 
F(u,) — F(u). 


The set of all continuous linear functionals on # will be denoted #’. An 
element F ¢ 2’ will be called a distribution of type #', or simply a distribution. 
An example is the 6-distribution defined by 


(1) d(u) = u(0). 

A second class of examples is given by 

(2) Flu) = | etu(t) dt, 
0 


zeEC. 


Suppose /: R-—>C is a continuous function such that for some ae R, 
MeR, 


(3) ft)=0, t<M, 
(4) e~*f(t) is bounded. 
We may define 

F,:L—>C 
by 
(5 Fy) =| fledu(e) de 


In fact the integrand is continuous and vanishes for t < M. If a’ > a then 
on [M, oo) we have 


|f(t)u(t)| = le“ *f(t)| eS 2°t| le?*u(r)| 


clulo.a.met 9%, 


IA Il 


where c is a bound for |e~“f(t)|. Therefore the integral (5) exists and 


(6) |F;(u)| < clilowe. | efa-ant dy, 
M 
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It follows from (6) that F; is continuous on Y i.e., Fy € &’. 

We say that Fe &’ is a function, or is defined by a function, if there is a 
continuous function f/ satisfying (3) and (4), such that F = F;. 

Suppose Fe #’ is defined by f. The translates 7,f and the complex 
conjugate function f* also define distributions. It is easy to check that if 
g=T,f, 1.¢e., g(t) = f(t — s), te R, then 


Fu) = F,(T_ <u), ue L, 
Similarly, 
F,*(u) = F,(u*)*, ue Z. 


We shall define the translates and the complex conjugate of any arbitrary 
Fe #’ by 


(7) (7,.F)(u) = F(T_u), ue&% 

(8) F*(u) = F(u*)*, uc TL, 
Similarly, the real and imaginary parts of Fe #’ are defined by 
(9) Re F = 4(F + F*), 

(10) Im F = = (F* — F). 


We say that Fis real if F = F*. 
Suppose Fe ¥” is defined by f and suppose the derivative Df exists, is 
continuous, and satisfies (3) and (4). Integration by parts gives 


Fp(u) = —F(Du), uceZ. 
Therefore we shall define the derivative DF of an arbitrary Fe £’ by 
(11) DF(u) = —F(Du), uc £. 
Generally, for any integer k > 0 we define 
(12) D*F(u) = (—1)*F(D*u). 


Proposition 3.1. The set £’ is a vector space. If F € £', then the translates 
T,F, the complex conjugate F*, and the derivatives D*F are in #’. 


Proof. All these statements follow easily from the definitions and the 
continuity of the operations in # ; see Lemma 2.2. For example, D‘F: # > C 
is certainly linear. If u, > u (#), then 


D*u, > D*u (Ff), 
SO 


D*F(u,) = (—1)*F(D*u,) > (—1)*F(D"u) = D*F(u). 


Thus D*F is continuous. [ 
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A sequence (F,)? © &’ is said to converge to Fe £’ in the sense of Lf 
if for eachue & 


F,(u) > F(u) as n->o. 
We denote this by 
Fi—>F (#’). 


The operations defined above are continuous with respect to this notion of 
convergence. 


Proposition 3.2. Suppose 
F,—>F (2), G,>G (#’, 
and suppose ae C, se R. Then 


aF, > aF (£'), 
F,+G,>F+G (#’, 
T,F,—>T,.F (£'), 
D*F,, > D*F (’). 


Moreover, the difference quotient 
s[T_F — F]>F (#’) 
as s—>0. 


Proof. All except the last statement follow immediately from the defi- 
nitions. To prove the last statement we use Lemma 2.2: 


s~*[T_,F — F](u) = F(s~*[T,u — u}) 
—> F(— Du) = (DF\u). o 


The following theorem gives a very useful necessary and sufficient con- 
dition for a linear functional on ¥ to be continuous. 


Theorem 3.3. Suppose F: £ —> C is linear. Then F is continuous if and 
only if there are an integer k > 0 and constants a, M, K < R such that 


(13) [F(u)| < Kluleam  QlueZ 
Proof. Suppose (13) is true. If u,->u (F) then 
|F(un) — F(u)| < K|un — ule.a.m > 0. 


Thus F is continuous. 
To prove the converse, suppose that (13) is not true for any k, a, M, K. 
In particular, for each positive integer k we may find a v, ¢ ¥ such that 


F(vx)| = Klvele = KlUlen,-~ # 0. 
Let u,E £ be 


un = kon] 7 Oy. 
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Then 
Jule =k-*, = [F(u,)| = 1. 
But Corollary 2.5 implies that 
u,—>0 (fF). 
Since F(u,,) does not converge to 0, Fis not continuous. 


The support of a function u: R —> C is defined as the smallest closed subset 
A of R such that u(t) = 0 for every t ¢ A. Another way of phrasing this is 
that ¢ 1s not in the support of u if and only if there is an e > O such that u is 
zero on the interval (t — e, t + «). The support of u is denoted 


supp (wu). 
Condition (3) on a function f can be written 
supp (f) < [M, ~). 


The support of a distribution Fe £’ can be defined similarly. A point 
t € Ris not in the support of F if and only if there is an e > 0 such that 


F(u) = 0 
whenever ue ¥ and supp (uw) € (¢t — e, t + 2). We denote the support of F 
also by 
supp (F). 
Theorem 3.3 implies that any Fe #’ has support in a half line. 
Corollary 3.4. If Fe &’, there is MeER such that 
supp (F) < [M, 00). 
Proof. Choose k, a, M such that (13) is true for some K. If ue # and 
supp (u) < (—0, M), 


then 
[Ul i.a,m = 0 


so F(u) = 0. Therefore each t < Mis not inthe support of F. Q 


Exercises 


1, Compute the following in the case F = 38: 
T,F(u), F*(u), Re F, Im F, D*F(u). 
2. Show that if F is given by (2), then 
DF = 6 + ZF. 
3. Suppose F = D’8. For what constants k, a, M, K is (13) true? 
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4, Find supp (D*68). 
5. Prove that if F is defined by a function f, then 


supp (F) = supp (/). 


§4. Characterization of distributions of type #’ 


If f: RC is a function satisfying the two conditions (3) and (4) of §3 
and if k is an integer >0, then 


(1) D*F, 


is a distribution of type Y’. In this section we shall prove that, conversely, 
any Fe 2%’ is of the form (1) for some k and some function f. The proof 
depends on two notions: the order of a distribution and the integral (from 
the left) of a distribution. 

A distribution Fe &’ is said to be of order k if (13) of §4 is true, i.e., if 
for some real constants a, M, and K, 


(2) |F)| < Kluliam, allue& 


By Theorem 3.3, each Fe &’ is of order k for some k > 0. 
Suppose Fe L’ is defined by a function f. Let g be the integral of f from 
the left: 


ei={ _ ss)as= | fs) ds, 


where supp (/) © [M, 00). Ifue # and v = Su, then Dv = u. Integration 
by parts gives 


Fu) =| gv'dr=—-f gut) at = —F(S.0d. 


For an arbitrary Fe &’ we define the integral of F (from the left), denoted 
S_F, by 
S_F(u) = —F(S,u). 


Proposition 4.1. Jf Fe &’ then the integral S_F is in L' and 
(3) D(S_F) = F = S_(DPF). 
If F is of order k = 1, then S_F is of order k — 1. 


Proof. Clearly S_F is linear. The continuity follows from the definition 
and the fact that S, is continuous in & The identity (3) is a matter of 
manipulation: 


D(S_F)(u) = —S_F(Du) = F(S, Du) = F(u), uc L, 
and similarly for the other part. ] 
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To prove that every Fe #’ is of the form (1), we want to integrate F 
enough times to get a distribution defined by a function. To motivate this, 
we consider first a function f: R—> C such that the first and derivatives are 
continuous and such that 


supp (f) < [M, ~), 
some M. Then by integrating twice and changing the order of integration 
(see §7 of Chapter 2) we get 
t t s 
fit) = | Df(s) ds = | | D*f(r) dr ds 
t t 
= | | D*f(r) ds dr 


- | "= )DY(r) ar. 
Let i 


(4) A(t) =|t|, ¢<9, 
= 0, t > 0. 


Then our equation is 


f(t) = { h(r — t)D?f(r) dr 
or 


(5) fe) = [DAT ar, 


We would like to interpret (5) as the action of the distribution defined 
by D?f on the function 7,h; however 7;,h is not in ¥% Nevertheless, h can be 
approximated by elements of & 


Lemma 4.2. Let h be defined by (4). There is a sequence (h,)f © # such 
that 


(6) [tn — hlo.am—~>O as n->0O 
for eachaeR, MER. 


Proof. There is a smooth function p: R —> R such that 0 < 9(t) < 1 for 
all t and 9(t) = 1, t < —2, o(t) = 0, t = —1; see §8 of Chapter 2. Let 


h,(t) = p(t] n)A(t) = Pr(t )A(t). 


Then h,, is smooth, since ¢, is zero in an interval around 0 and / is smooth 
except at 0. Also h,(t) = A(t) except in the interval (—2/n, 0), and 


|h,(t) — A(t)| < 2/n,  te(—2/n, 0). 
Thus h, € ¥ and (6) is true. [J 
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Theorem 4.3. Suppose Fe £' is of order k — 2, where k is an integer 
= 2. Then there is a unique function f such that 
F = D*(F te 


Proof. Suppose first that k = 2, F is of order 0. Let h be the function 
defined by (4) and let (h,)? < £ be as in Lemma 4.2. Choose a, M,KeR 
such that 


(7) [F(w)| < Kluloam, alluweL. 
We may suppose a > 0. For each s € R the translates T,h, also converge toh: 
(8) [Tstn — Tshlo.ou—>0 as n—>0o, 


It follows from (7) and (8) that for each se R 
F(T,h,) converges as n-—>0oo. 


Let f(s) be the limit of this sequence. Then 


9) M(8)| < lim K|TPylo,en = K\Tetlovae 
But 

(10) [T:hlo.a.m = 0 ifs < M, 
(11) l[Tshlo.am < (s — M)e* ifs > M. 
Thus 


supp (f) < [M, 0) 
and for any a’ > a there is a constant c such that 
|f(s)| < ce”, all seR. 


If f is continuous, it follows that f defines a distribution F, ¢ Z'. Suppose 
s < t. Taking limits we get 


If@) — f(s)| < KITA -— TAlo.am 
< Ke*(t — s). 


Thus fis continuous. . 
Let f,(s) = F(T,h,). Then f,(s) = O ifs < M. Fors > M, 
ACD) ~ f(s)| Ss K\T hy -_ Tsh\o,a,m 
< 2Ke*(s — M)/n. 
Therefore if ue Y 
F(u) = | fls)u(s) ds = lim G,(u) 
where G,, is the distribution defined by f,. Then 


D°G,(u) = G,(D2u) = { F(T,h,) D?u(s) ds. 
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Let v, be the function defined by 
v,(t) = | T,h,(t)D?u(s) ds 
= | h,(t — s)D?u(s) ds. 


Since D?ue LZ andh,(t — s) = Oifs < t, the integral converges. Moreover, 
it is not difficult to see that the integral is the limit of its Riemann sums 


N2 


taut) = yD, , halt ~ mlN)D*UCrIN), 


in the sense that 
lUn.w — Unloaam—>O as Noo, 


In fact, fort > M, 


min 


> | [h,(t — m/N)D?u(m/N) — h(t — s)D?u(s)] ds 


m=-1)/n 


[Un,w(t) — 0,(t)| = 


& * —_ -as _ Cc —at 
<q lr-sle ds we’? 
where c and c’ are independent of t and N. Therefore 
F(v,) = lim F(,,y) 
N 


= lim 5 D, Palm|N) Dum) 


- { f(s) D?u(s) ds = D®G,(u). 
Now let 
“(t) = | h(t — s)D°u(s) ds. 
Then 
lUn — Viloam—>9 as noo. 


In fact, fort > M 
lose) — v()| < | [aalt — 8) — AC — 5)] |D2u(5)] ds 
t 
Qin 
< | |D°u(s + 1)| ds < ce-**/n. 
0 
Therefore 


D?F(u) = F,(D?u) = lim G,(D7u) 
= lim F(v,) = F(v). 
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But 


r(t) = | ” (§ — t)D°u(s) ds 


— | ” Du(s) ds = u(t). 


Thus D2F, = F. 


Now suppose Fis of order k — 2 > 0. Let G = (S_)*~2F. Then G is of 
order 0, and by what we have just shown, there is an f such that 


D*F, = G. 
But then 
D*F, => D*-2G = F. 


Finally, we must prove uniqueness. This is equivalent to showing that 
DF = Oimplies F = 0. But F = S_(DF), so this is the case. {J 


Exercises 


1. Show that D*6 is of order k but not of order k — 1. 
2. Find the function f of Theorem 4.3 when F = 6. Compute DF; = S_6. 


3. Show that if supp (Ff) ¢ [M, oo), then supp (S_F) < [M, 00), and 
conversely. 


4. Suppose Fe 2%’ and the support of F consists of the single point 0. 
Show that F is of the form 


Ss a,,D*65, 
k=0 


where the a,,’s are constants. 


§5. Laplace transforms of functions 


Suppose that fis a function which defines a distribution of type &%’, i.e., 
f: R—C is continuous, and 


(1) supp (f) < [M, 0), 
(2) [f(t)| < Ke“, ~— allt. 
If z EC, let e, be as before: 

e(t) = e~*, teR. 
If Rez = b > a then 

|F(t)e(t)| < Kee~™, 
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Since f(t) = 0 for t < M, the integral 
(3) [feed ae = [fe a 


exists when Re z > a. The Laplace transform of the function f is the function 
Lf defined by (3): 


(4) Lf(z) = | e-*4(t) dt,  Rez>a. 
Theorem 5.1. Suppose f: R— C is continuous and satisfies (1) and (2). 
Then the Laplace transform Lf is holomorphic in the half plane 
{z| Rez > a}. 


The derivative is 


(5 (Ify@) =| e-*ef(e) de 

The Laplace transform satisfies the estimate 

(6) |Lf(z)| < K(Rez — a)~1exp(M(a — Rez)), Rez>a. 
Proof 


Ov = 2)*ELf0) — LF = felon, 2, 2/0) a 


where 
g(w, Zz, t) = (w — z)74fe7 — e- #4], 
Suppose Re z and Re w are => b > a. Let 
h(s) = exp [—(1 — s)z — swt, O<s<l. 
Then 
g(w, Zz, t) = (w — z)~*[A(1) — AO)]. 


An application of the Mean Value Theorem to the real and imaginary parts 
of h shows that 


[h(1) — h(0)| < clw— zie", =t20 
where 5b’ = bif b > 0, b’ = max {Re w, Re z} otherwise. Thus as w — z, 
| g(w, z, t)f(t)| < cie~*, where «> 0. 
Moreover, 
E(w, 2, t) f(t) > —te-*f(t) 


as w—>z, uniformly on each interval [M, N]. It follows that Lfis differen- 
tiable and that (5) is true. 
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The estimate (6) follows easily from (1) and (2): 
IL@)| < | |f(@en*| at 
M 
< K| exp t(a — Re z) dt 
/M 


= K(Rez — a)" exp (M(a — Rez)). O 
We want next to invert the process: determine f, given Lf. 


Theorem 5.2. Suppose f satisfies the conditions of Theorem 5.1, and let 
g = Lf. Given b > max {a, 0}, let C be the line 


{z| Rez = 5}. 
Then f is the second derivative of the function F defined by 


(7) F(t) = = | ez -29(z) dz. 


Proof. By (6), g is bounded on the line C. Therefore the integral (7) 
exists. Moreover, if 


sve) =| en *f(t) at 


-N 


then the gy are bounded uniformly on the line C and converge uniformly to 
g. Thus F(t) is the limit as N —> 00 of Fy, where 


1 
Fyt) = 5 | e*tz-29.(z) dz 
C 


— 1 Zt~—2 " ~— 28 
=a |e Z i f(s) ds dz 


— “ I 2(t—s).-2 
=|- mil ° Z dz f(s) ds, 


Let us consider the integral in braces. When s > ¢ the integrand is holo- 
morphic to the right of C and has modulus < k|z|~? for some constant k. 
Let Cz be the curve consisting of the segment {Re z = b| |z — b| < N} and 
the semicircle {Re z > b | |z — b| = N}. Then the integral of 


over Cp in the counterclockwise direction is zero, and the limit as R —> 0 is 


Cc 
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Thus the integral in braces vanishes for s > t. When s < ft, let Cz be the 
reflection of C, about the line C. Then for R > 5, 


I d 


=f-—sS. 
Taking the limit as R — 00 we get 


_ 


Tai e*t-s)7-2 dz = t — §, s<t. 
vin 
Cc 


Thus . 
F(t) = | (t — s)f(s) ds. 


It follows that DPF =f. Q 
We can get a partial converse of Theorem 5.1. 
Theorem 5.3. Suppose g is holomorphic in the half plane 
{z| Rez > a} 


and satisfies the inequality 


(8) |g(z)| < K(1 + [z|)~? exp (—M Re z). 

Then there is a unique continuous function f: R — C with the properties 
(9) supp f < [M’, 0), some M’, 

(10) |f(t)| < Ke", all t, for some b, 

(11) Lf(z) = g(z) for Rez>b. 


Moreover, we may take M' = M in (9) and any b > a in(10) and (11). 
Proof. Choose b > a and let C(b) be the line {z | Re z = 5}. Let 


(12) f(t) = mi |. e*!g(z) dz. 


It follows from (8) that the integral exists and defines a continuous function. 
It follows from (8) and an elementary contour integration argument that (12) 
is independent of b, provided b > a. Moreover, (8) gives the estimate 


(13) [fi] < Cet-, = > a, 
where C is independent of 5 and t. This implies (10). If t < M we may take 
b —> +00 in (13) and get 
f(t) =0,t < M. 
Thus the Laplace transform of f can be defined for Re z > a. If Rew > a, 


choose 
a<b< Rew. 
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Then 


Lf) = {. eels | oO az\ dt. 


Since 
le~ “+ 2to(z)| < c(1 + |z|)"2 exp [—Mb + t(b — Rew)] 


for t¢ R and ze C, we may interchange the order of integration. This gives 


Lf(w) = - \. ' ge) | e@— wit ar} dz 
—] 


= Tai g(zje@-Y™(z —_ w)7} dz. 
Cb) 
In the half plane Re z > 5 we have 
| g(zje@-™| < cl + |z|)7?. 


Therefore a contour integration argument and the Cauchy integral formula 
give 
Lf(w) = g(z)eer PM | aw 
= g(w). 


Finally, we must show that fis unique. This is equivalent to showing that 
Lf = 0 implies f = 0. But this follows from Theorem 5.2. [ 


Exercises 


1. Let f(t) = 0, t < 0; f(t) = te*, t > 0. Compute Lf and verify that 


ft) = 55, | e*'L f(z) dz, 


where C is an appropriate line. 
2. Show that the Laplace transform of the translate of a function f 
satisfies 
L(T;f)(z) = e*Lf(z). 
3. Suppose both fand Df are functions satisfying (1) and (2). Show that 
L(Df)(z) = zLf(z). 
4. Compute Lf when 
fH=0, ¢<0; f@=t, t>0, 


where 7 is a positive integer. 
5. Suppose / satisfies (1) and (2), and let g = e,, f. Show that 


Lg(z) = Lf(z — w). 
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6. Compute Lf when 
f(t) = 0, t<0; f(t) = et", t> 0. 
7. Compute Lf when f(t) = 0, t < 0; 


t 
f(t) =| sinsds, 1>0. 


0 


$6. Laplace transforms of distributions 
Suppose Fe #’. Theorem 3.3 states that there are constants k, a, M, K 
such that 
(1) |F(u)| < Klul,am,  allue®% 


If Re z > a, then Lemma 2.3 states that there is a sequence (u,)? © # such 
that 


(2) \Un — Czle.om—>0 as no, 


where e.(t) = e~*. Now (1) and (2) imply that (F(u,))?-1 is a Cauchy 
sequence in C. We shall define the Laplace transform LF by 


(3) LF(z) = lim F(u,). 


In view of (2) we shall write, symbolically, 
(4) LF(z) = F(e,) 
even though e, ¢ &. Note that if (v,)P < # and 


lUn — Czlk.am—>0O aS n—>©OO 
then 
|F(v,) — Fu,)| < Kl0n — Unlx,a,m > 0 
as n—>oo. Thus LF(z) is independent of the particular sequence used to 
approximate e,. 


Proposition 6.1. Suppose F,Ge Y' and beEC€. Then on the common 
domain of definition 


(5) L(OF) = bLF; 

(6) L(F + G)=LF+1G; 

(7) L(T,F)(z) = e~*LF(z); 

(8) L(D*F)(z) = z*LF(z); 

(9) L(S_F)(z) = z7*LF(z), z #0. 
Moreover, 


(10) L(eyF)(z) = LF + w), 
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where e,,F is the distribution defined by 


(11) é,f(u) = Fle,u), ucE&hH 
If F is defined by a function f, then 
(12) LF = Lf. 


Proof. The identities (5) and (6) follow immediately from the definitions. 
If (u,)° < £ satisfies (2) then the sequence (T_,u,) approximates 7T_,e, in 
the same sense. But 


T_,€, = e *e,, 
SO 


L(T,F)(z) = lim T,F(u,) = lim F(T_,u,) 


= e~* lim F(u,) = e~*LF(z). 


This proves (7), and the proofs of (8), (9) and (10) are similar. Note that 
ue £ implies e,ue £ and 


| 


u,—>u (L) 
implies 
Cyn —> Cu (L). 


Therefore (11) does define a distribution. 
Finally, (12) follows from the definitions. [J 


We can now generalize Theorems 5.1 and 5.2 to distributions. 


Theorem 6.2. Suppose Fe £' and suppose F satisfies (1). Then the 
Laplace transform LF is holomorphic in the half plane 


{z| Rez > a}. 
Moreover, 
(13) F = D*t3F., 
where f is the function defined by 


(14) f(t) = Al e%z—k- 27 F(z) dz. 


Here C is the line {z | Re z = b}, where b > max {a, 0}. 
Proof. We know by Theorem 4.3 that there is a function f such that 
F = D**°F,, 


It was shown in the proof of Theorem 4.3 that for any b > max {a, 0} there 
is a constant c such that 


| f(t)| < ce™, all t. 
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Therefore Lf is holomorphic for Re z > max {a, 0}. By Proposition 6.1, 
(15) LF(z) = z**Lf(z), Re z > max {a, 0}. 


Therefore LF is holomorphic in this half plane. This completes the proof 
of the first statement in the case a => 0. When a < 0, let G = e,F. Then (1) 
implies 

|Gu)| < K|ule,ou- 


Thus by the argument just given, LG is holomorphic for Rez > 0. Since 
LF(z) = LG(z — a), LF is holomorphic for Re z > a. 

Now let C be the line {z | Re z = b}, where b > max {a, 0}. Let f be the 
function such that (13) is true. Then by Theorem 5.2 and equation (15), fis 
the second derivative of the function 


(16) g(t) = 7 | ettz-k-47 F(z) daz, 


From the definition of LF it follows that on C 
(17) ILF(2)| < Kleelna,ae = K|z|"eo™-™™, 


Using (17) we may justify differentiating (16) twice under the integral sign 
to get (14). JO 


Theorem 6.2 implies, in particular, that if LF = 0 then F = 0. 
Given a holomorphic function g, how can one tell whether it is the 
Laplace transform of a distribution? 


Theorem 6.3. Suppose g is holomorphic in a half plane 
{z| Rez > a}. 


Then g is the Laplace transform of a distribution Fe &' if and only if there 
are constants k, a, M, K, such that 


(18) |g(z)| < K,(1 + |z|)* exp (—M Rez), Rez > a. 


Proof. Suppose g = LF, where Fe #’. Then there are k, a, M, K such 
that (1) is true. Then Re z > a implies 


|LF(z)| < Klezli.am < Ki(1 + |z|)* exp (—M Re 2), 


where K, = Ke™., 
Conversely, suppose (18) is true. Take b > max {a, 0}. We may apply 
Theorem 5.3 to 


h(z) = z~*~?g(z) 
to conclude that 


h = Lf, 
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where / is continuous, 


supp f © [M, 0), 
[f(@)| < ce”. 


Let F = D*+?f, Then 
LF(z) = 2***h(z) = g(z), Rez > b. 


Since this is true whenever b > max {a, 0}, the proof is complete in the case 
a> 0. 
When a < QO, let 


8i(2) = g(z + a). 
Then g, is holomorphic for Re z > 0 and satisfies 
|g3(z)| < K.(1 + |z|)* exp (— MM Re z). 
It follows that gz, = LF, for Rez > 0, some F, € #’. Then 
g = LF, F = e_,F;. O 


Exercises 


1. Compute the Laplace transforms of D*6, k = 0,1,2,... and of 7,8, 
seER. 
2. Compute the Laplace transform of F when 


F(u) = | : e“'y(t) dt. 


0 


§7. Differential equations 


In §§5, 6 of Chapter 2 we discussed differential equations of the form 
u'(x) + au(x) = f(x), 
and of the form 
u"(x) + bu'(x) + cu(x) = f(x). 


In this section we turn to the theory and practice of solving general nth order 
linear differential equations with constant coefficients: 


(1) a,u™ + a,yu"-Y) +.---au' + au =f, 
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where the a; are complex constants. Using D to denote differentiation, and 
understanding D° to be the identity operator, D°u = u, we may write (1) in 
the form 


(1)’ 2, a,,D*u = 
Let p be the polynomial 
p(z) = > ay,2". 
k=0 


Then it is natural to denote by p(D) the operator 


(2) p(D) = Ss a,,D*. 
k=0 
Equation (1) becomes 
(1)" p(D)yu = f. 
We shall assume that the polynomial is actually of degree n, that is 
a, # 0. 


Before discussing (1)” for functions, let us look at the corresponding 
problem for distributions: given H € £’, find Fe &’ such that 


p(D)F = H. 


Theorem 7.1. Suppose p is a polynomial of degree n > 0, and suppose 
He &'. Then there is a unique distribution F €« £' such that 


(3) p(D)F = H. 


Proof. Distributions in #’ are uniquely determined by their Laplace 
transforms. Therefore (3) is equivalent to 


(4) L(p(D)F)\(z) = LH(z), Rez>a 
for some ae R. But 
L(p(D)F)(z) = p(z)LF(z). 


We may choose a so large that p(z) # Oif Re z = a, and so that LA is holo- 
morphic for Re z > a and satisfies the estimate given in Theorem 6.3. Then 
we may define 


g(z) = pz) *LH(z), Rez>a. 
Then g is holomorphic, and it too satisfies estimates 
|2g(z)| < K(1 + |z|)* exp(—M Rez), Rez >a. 


Theorem 6.3 assures us that there is a unique F ¢ #’ such that LF = g, and 
then (4) holds. { 
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The proof just given provides us, in principle, with a way to calculate F, 
given H. Let us carry out the calculation formally, treating F and H as 
though they were functions: 


F(t) = =a | e®I_ F(z) dz 
Cc 


— i tz -1 
= 5 [ e*p(z)~ *LH(z) dz 


= + | | e’n(z)~ te * H(s) ds dz 
2nidoJr 


“= | EE I et 92p(7) 1 ae H(s) ds 
=| G(r - s)H(s) ds 

where 

(5) G(t) = a | ep(z)~1 dz 


and Cis a line Rez = b> a. 


We emphasize that the calculation was purely formal. Nevertheless the 
integral (5) makes sense if p has degree >2, and defines a function G. 
Equivalently, 


, —_ 1; I tz ~il 
(5) G(t) = lim = \. e*p(2)-} dz, 


where C; is the directed line segment from b — iR to b + iR, R > 0. We 
shall show that the limit (5)’ also exists when p has degree 1, except when 
t = 0. The function defined by (5)’ is called the Green’s function for the 
operator p(D) defined by the polynomial p. Our formal calculation suggests 
that G plays a central role in solving differential equations. The following two 
theorems provide some information about it. 


Theorem 7.2. Suppose p is a polynomial of degree n= 1; suppose 
Z1, Z9,..., 2, are the distinct roots of p, and suppose that z; has multiplicity m,. 
Then (5)' defines a function G for allt # 0. This function is a linear combination 
of the functions g;,,1 <j <1r,0< k < mj, where 


S(t) = 0, [< 0; 
ByA(t) = t* exp (zt), t> 0. 


Proof. Suppose t < 0. Let Dz denote the rectangle with vertices b + iR, 
(6 + R*?) + iR. When Rez = J, 


(6) le“p(z)~*| < e(t)(1 + |2|)~" exp ¢(Re z — 5), 
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where c(t) is independent of z. Now the line segment Cy is one side of the 
rectangle Dz, and the estimates (6) show that the integral of e*p(z)~+ over 
the other sides converges to 0 as R-> oo. On the other hand, the integral 
over all of Dp, vanishes, because the integrand is holomorphic inside Dp. 
Thus the limit in (5)’ exists and is 0 when ¢ < 0 (and also when ¢ = 0, if 
n> 1). 

Suppose ¢ > 0. Let Dp now be the rectangle with vertices b + iR, 
(b — R'?) + iR, with the counterclockwise direction, and suppose R is so 
large that D, contains all roots of p(z). Then the integral of e#p(z)~* over 
Dp is independent of R, and the integral over the sides other than C, tends 
to 0 as R-> oo. Thus again the limit in (5)’ exists, and 


” _ i at -1 
(5) Gt) = 5 \.. ety(z)-1dz, 1t>0. 


Now we may apply Theorem 6.3 of Chapter 6: G(t) is the sum of the residues 
of the meromorphic function e*p(z)~*. The point z, is a pole of order m,, 
SO near z, we have a Laurent expansion 


p(Z)-*= > ba(z — 2,)". 


m2—~ My 


Combining this with 


et = et > (m!)-Mz — 2)", 


me20 


we see that the residue (the coefficient of (z — z,)~1 in the Laurent expansion) 
at z, is a linear combination of 


t®exp(zt), O<sk<m;; 
moreover, it is the same linear combination whatever the value of ¢. {J 


Suppose f is a complex-valued function defined on an interval (a, 5). 
We write 


f(a+) = lim f(a) 


when the limit on the right exists as t approaches from the right. 
We take the Green’s function for p(D) to be 0 at t = 0; whenz > 1 this 
agrees with (5)’. 


Theorem 7.3. Let p be a polynomial of degree n > 0, with leading co- 
efficient a, # 0. Let G be the Green’s function for p(D). Then G is the unique 
function from R to C having the following properties: 


(7) all derivatives D*G exist and are continuous when t # 0; 

(8) the derivatives D*G exist and are continuous at 0 when k < n — 2; 
(9) G(t) = 0, t< 0; 

(10) p(D)G(t) = 0, t> 0; 

(11) a,D"-*G(0+) = 1. 
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Proof. We know that G is a linear combination of functions satisfying 
(7) and (9), so G does also. When t > 0 we may differentiate (5)” and get 


(12) D*G(t) = 5 | zketty(z)-1 de, 
DR 


Thus 


D*'G(0+) = | z"p(z)~+ dz. 
Dr 


We may replace Dp by a very large circle centered at the origin and conclude 
that 


D'GO0+)=0, k<n—2. 


Therefore (8) is true. Let us apply the same argument when k =n — 1. 
Over the large circle the integrand is close to 


2"-Wayz,)"* = a2, 
SO 

D*-*G(0+) = a,7}. 
Finally, (12) gives 


— I 2t —_- 
PD)G(t) = 5 \. etdz=0, 1t>0. 


Now we must show that G is uniquely determined by the properties 
(7)(11). Suppose G, also satisfies (7)-(11), and let f= G — G,. Then f 
satisfies (7)-(10); moreover D"~+/(0) = 0. We may factor 


P(z) = a,(Z — 21)(Z — Z2)+++(Z — 2), 
where we do not assume that the z, are distinct. Let fg = f, and let 
z= Dfe-1 — 2rfi-1 k>0. 
Then each f, is a linear combination of Df, 0 < j < k, so 


f(0)=0, ken—-1. 


Moreover, 
Sn = (D _- Z,)(D — Zn-1)° ° -(D — Zi)f 
= d,~*p(D)f = 0. 
Thus 
Fn-1(0) = 0, Dfn-1 - ZnJn-1 =fr = 0, 
sO 
Sa-1 = 0. 
Then 


Fn-2(0) = 0, Df, -2 _ Zn-1Jn-2 = 0, 
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so f,-2 = 0. (We are using Theorem 5.1 of Chapter 2). Inductively, each 
fh=0k <n,sof=OandG=G,. J 


Now let us return to differential equations for functions. 


Theorem 7.4. Suppose p is a polynomial of degree n > 0, and suppose 
Ff: [0, 00) ~ C is a continuous function. Then there is a unique solution 
u: [0, 00) — C to the problem 


(13) P(D)u(t) = ft), = t > 0; 
(14) D’u(0+) = 0, O<j<n-l. 
This solution u is given by 


(15) u(t) = { G(t — s)fts) ds, 


where G is the Green’s function for the operator p(D). 


Proof. We use properties (7)-(11) of G. Let u be given by (15) for t > 0. 
Then successive differentiations yield 


(16) Du(t) = G(0+)f(t) + [ DG(t — s)f(s) ds 


_ [ DG(t — s)f(s) ds,..., 


(17) D'u(t) = [ “DkG(t — s)f(s)ds, ke <n—1, 
(18) D'u(t) = a,~*f(t) + [ peat — s)f(s) ds. 
Thus 


pDu(t) = fl0) + | pCD)G(e ~ s fle) 
= f(t) 


Moreover, (17) implies (14). Thus uw is a solution. The uniqueness of u is 
proved in the same way as uniqueness of G. [J 


We conclude with a number of remarks. 


1. The problem (13)-(14) as a problem for distributions: Let us define 
f(t) = 0 for t < 0. If f does not grow too fast, i.e., if for some ae R 


e~“f(t) is bounded, 
then we may define a distribution H € #’ by 


H(v) = | fv) dt, ve &. 


Differential equations 219 


Suppose uw is the solution of (13)-(14). Then it can be shown that u defines a 
distribution F, and 


(19) P(D)F = H. 
Thus we have returned to the case of Theorem 7.1. 


2. If the problem (13)-(14) is reduced to (19), then the proof of Theorem 
7.1 shows that the solution may be found by determining its Laplace trans- 
form. Since there are extensive tables of Laplace transforms, this is of 
practical as well as theoretical interest. It should be noted that Laplace 
transform tables list functions which are considered to be defined only for 
t = 0; then the Laplace transform of such a function fis taken to be 


fo) 


Lf(z) = [ e- f(t) dt. 


0 


In the context of this chapter, this amounts to setting f(t) = 0 for t < 0 and 
considering the distribution determined by f, exactly as in Remark 1. 


3. Let us consider an example of the situation described in Remark 2. 
A table of Laplace transforms may read, in part, 


f Lf 
sin t (z7 + 1)7} 
sinh f¢ (z? — 1)74 


(As noted in Remark 2, the function sin ¢ in the table is considered only for 
t > 0, or is extended to vanish for t < 0.) 
Now suppose we wish to solve: 


(20) u"(t) — u(t) — sint = 0, t>0; 
(21) u(0) = u’(0) = 0. 
Let p(z) = z* — 1. Our problem is 


p(D)u = sin t, t>0; 
u(0) = Du(O) = 0. 


The solution u is the function whose Laplace transform is 
p(z)~+L(sin t)(z) = (2? — 1)73(z? + 1)74. 
But 
(z2 — 1)-1(z? + 1)72 = Az? — 127? — 42? + «YY. 
Therefore the solution to (20)-(21) is 
u(t) = 4sinh¢t — 4sin ft, t> 0. 
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4. In cases where the above method fails, either because the given 
function f grows too fast to have a Laplace transform or because the function 
Lu cannot be located in a table, one may wish to compute the Green’s 
function G and use (15). The Green’s function may be computed explicitly 
if the roots of the polynomial p are known (of course (5)” gives us G in 
principle). In fact, suppose the roots are 2, Zz,...,Z, With multiplicities 
M1, Mo,..., mM, We know that G for t > 0, is a linear combination of the n 
functions 


t*® exp (z,t), k < m,. 
Thus 
G(t) = > Cyt" exp (z,t), t > 0, 


where we must determine the constants c,,. The conditions (8) and (11) give 
n independent linear equations for these n constants. In fact, 


G0+) = > eo, 
DG(O+) = > Z4Cjq + > Cis 


etc. 

5. The more general problem 
(22) P(D)u(t) = f(t), — t > 0; 
(23) D'u(0+) = by; O<k<n 


may be reduced to (13)-(14). Two ways of doing this are given in the exercises. 
6. The formal calculation after Theorem 7.1 led to a formula 


F(t) = | G(t — s)H(s) ds 


which it is natural to interpret as a convolution (see Chapter 3). A brief 
sketch of such a development is given in the exercises. 


Exercises 


1. Compute the Green’s function for the operator p(D) in each of the 
following cases: 


P(z) = 22 — 4z — 5 
P(z) = z? — 42 + 4 
p(z) = 22 + 2227 —z—2 
P(z) = 2? — 3z + 2. 
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2. Solve for u: 


u"(t) — 4u’(t) + 4u(t) = et, t> 0, 
u(0+) = uw’(0O+) = 0. 


3. Solve for u: 


u"(t) — 3u’(t) + 2u(t) = t? — cost, t> 0, 
u0+) = uw(0+) = v’(0+) = 0 


4. Let up: (0, 00) + C be given by 
n-1 

Uo(t) = > (kl) byt". 
k=0 


Show that 
D'u(0+)=b,, O<sk<n-—1. 
5. Suppose uy: (0, 00) — R is such that 
D'u(0+)=b, O<k<en-1. 


Show that u is a solution of (22)-(23) if and only if u = up + u,, where u, is 
the solution of 


P(D)u,(t) = f(t) — p(D)ud(t), t > 0, 
D'u(0+)=0, O<sk<en-1. 


6. Show that problem (22)-(23) has a unique solution. 
7. Show that the solution of 


P(D)yu(t)=0, t>0, 
D*u(0+) = 0, O0O<k<j and j<k<n-l, 
Diu(O+) = 1 


is a linear combination of functions t* exp zt. 
8. Show that any solution of 


P(D)u(t) = 0, t>0 


is a linear combination of the functions t* exp zt, where z is a root of p(D) 
with multiplicity greater than k, and conversely. 

9. Suppose u: (0,00) > C is smooth and suppose D*u(0+) exists for 
each k. Suppose also that each D*u defines a distribution F;, by 


F,(v) = [ D*u(t)v(t) dt. 


Show that 


DF, = Fy, + u(0+)6, 
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and in general 

k=1 

D'Fy = F, + > D'u(0+)D*-1~%, 
7=0 
10. In Exercise 9 let u(t) = G(t), t > 0, where G is the Green’s function 
for p(D). Show that 
P(D)Fo = 6. 


11. Use Exercise 9 to interpret the problem (22)-(23) as a problem of 
finding a distribution (when the function f defines a distribution in #’). 
Discuss the solution of the problem. 

12. Use Exercise 11 to give another derivation of the result of Exercise 5. 

13. Again let 


a(t) = u(—ft), Twu(t) = u(t — s). 
If Fe Y’ anducF, set 
Fx u(t) = F(T_,wu). 


(a) Suppose F = F,, where v: R — C is continuous, v(t) = Ofort < —M, 
and e~“v(t) is bounded. Show that for each u € Y’ the convolution integral 


5 « u(t) = | a(t — s)u(s) ds 
exists and equals 
F « u(t). 


(b) Show that for each Fe #' and ue & the function F' « u is in Z 
14. If F, He &, set 


(Fx Hu) = F(H*«u), ueZ 


Show that Fx He &’. 
15. Compute (D"s)~ * u, ue & Compute (D*8) « F, Fe &’. 
16. Show that 


L(F * H) = L(F)L(H). 


17. Let G be the distribution determined by the Green’s function for 
p(D). Show that 


LG = p(z)7. ° 
18. Show that the solution of 
P(D)F = H 
1S 
F=Gx4dH, 


where G is as in Exercise 17. 


NOTES AND BIBLIOGRAPHY 


Chapters I and 2. The book by Kaplansky [9] is a very readable source of 
further material on set theory and metric spaces. The classical book by Whittaker 
and Watson [25] and the more modern one by Rudin [18] treat the real and 
complex number systems, compactness and continuity, and the topics of Chapter 
2. Vector spaces, linear functionals, and linear transformations are the subject of 
any linear algebra text, such as Halmos [7]. Infinite sequences and series may be 
pursued further in the books of Knopp [10], [11]. More problems (and theorems) 
in analysis are to be found in the classic by Polya and Szegé [15]. 


Chapters 3,4,and5. The Weierstrass theorems (and the technique of approx- 
imation by convolution with an approximate identity) are classical. A direct proof 
of the polynomial approximation theorem and a statement and proof of Stone’s 
generalization may be found in Rudin [18]. 

The general theory of distributions (or “‘generalized functions’’) is due to 
Laurent Schwartz, and is expounded in his book [20]. The little book by Lighthill 
[12] discusses periodic distributions and Fourier series. Other references for dis- 
tribution theory and applications are the books of Bremermann [2], Liverman [13], 
Schwartz [21], and Zemanian [27]. 

Banach spaces, Frechet spaces, and generalizations are treated in books on 
functional analysis: that by Yosida [26] is comprehensive; the treatise by Dunford 
and Schwartz [4] is exhaustive; the sprightly text by Reed and Simon [16] is 
oriented toward mathematical physics. Good sources for Hilbert space theory in 
particular are the books by Halmos [6], [8] and by Riesz and Sz.-Nagy [17]. 

The classical L?-theory of Fourier series treats L? (0, 277) as a space of functions 
rather than as a space of distributions, and requires Lebesgue integration. Chap- 
ters 11 through 13 of Titchmarsh [24] contain a concise development of Lebesgue 
integration and the L?-theory. A more leisurely account is in Sz.-Nagy [14]. The 
treatise by Zygmund [28] is comprehensive. 


Chapter 6. The material in §1-§6 is standard. The classic text by Titchmarsh 
[24] and that by Ahlfors [1] are good general sources. The book by Rudin [19] 
also treats the boundary behavior of functions in the disc, related to the material 
in §7. 


Chapter 7. The Laplace transform is the principal subject of most books on 
** operational mathematics’ and “‘transform methods.’ Doetsch [3] is a compre- 
hensive classical treatise. Distribution-theoretic points of view are presented in 
the books of Bremermann [2], Erdelyi [5], Liverman [13], and Schwartz [21]. 
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